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TITLE OF THE INVENTION 

ORTHOGONAL GENE SWITCHES 



5 CROSS-REFERENCE TO RELATED APPLICATIONS 

This application claims the benefits of U.S. Provisional Applications Serial 
Number 60/514,362 filed October 24, 2003. 

FIELD OF THE INVENTION 
10 The present invention relates to novel gene switches that do not interfere with normal 

functions of endogenous nuclear receptors. 

BACKGROUND OF THE INVENTION 

15 Nuclear hormone receptor superfamily is the largest known family of eukaryotic 

transcription regulators. The superfamily includes steroid hormones receptors, such as glucocorticoid 
receptors (GR), androgen receptors (AR), mineralocorticoid receptors (MR), progesterone receptors (PR), 
estrogen receptors (ER), and nonsteroid hormones receptors, such as thyroid hormone receptors (TR), 
vitamin D receptors (VDR), and retinoic acid receptors (RAR), as well as orphan receptors whose ligands 

20 have not been found. The hormones, via binding to the corresponding receptors, play important roles in 
the regulation of complex physiological events, including key steps in development, maintenance of 
homeostasis, cellular proliferation, differentiation, and death. 

Nuclear hormone receptor action has been elucidated in considerable detail in vertebrate systems 
at both the cellular and molecular levels. In the absence of ligand, nuclear receptors for steroid hormones 

25 are bound with Hsp90, Hsp70 and p59 to form inactive complexes. The complexes reside in the 

cytoplasm, except for the estrogen receptor complexes, which are present in the nucleus. Upon binding 
the hormone, the receptors release the Hsp90, Hsp70 and p59 molecules, and translocate to the nucleus. 
Once inside the nucleus the receptors form homodimers and bind to the hormone response elements 
(HREs) at the regulatory regions of the target genes, resulting in the activation or repression of the target 

30 genes. In contrast, nuclear receptors for nonsteroid hormones, are bound to their response elements in the 
form of heterodimers free of hsp proteins even in the absence of the hormones. The nuclear receptors are 
activated by binding nonsteroid hormones. 

While the hormones are structurally diverse compounds, their receptors are highly structurally 
related proteins. Nuclear hormone receptors are modular proteins organized into structurally and 

35 functionally defined domains, including amino-terminal region, DNA-binding domain (DBD), and 
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ligand-binding domain (LBD). The ligand (hormone) binding domain, having a length of about 300 
amino acids, is located in the carboxy-terminal half of the receptors. The ligand-binding domain appears 
to fold into a complex structure, creating a specific hydrophobic pocket that surrounds the ligand. The 
LBD also contains sequences responsible for receptor dimerization, hsp associations (for steroid hormone 
5 receptors), ligand-dependent transactivation function, silencing/repressor function (when LBD binds to 
antagonists) and nuclear translocation signal. 

Although gene therapy shows great promise for treatment of a variety of pathologies, its practical 
application is still hampered by a number of hurdles, one of which is the control of trans-gene expression 
in patients through the administration of an exogenous agent. The control system of trans-gene 
10 expression is often called "gene switch". 

Ideally, the exogenous agent of a gene switch should affect exclusively the activity of the trans- 
gene and trans-gene expression should not be affected by endogenous agents. 

Many systems commonly used in the regulation of eukaryotic gene expression include 
prokaryotic or other non-human components. These systems include tetracyciine-dependent system 
1 5 derived from R coli Tet-repressor (TetR), ecdy sone (Ec)-dependent system derived from Drosophila Ec 
receptor, and rapamycine-dependent system. Because of their non-human components, these systems are 
likely to be immunogenic in humans or other immunocompetent hosts. 

Nuclear hormone receptors and their ligands are suitable candidates for gene switches. The 
expression of nuclear hormone receptors from a species should not be immunogenic in the same species. 
20 For example, the expression of human estrogen receptor should not be immunogenic in a human host. 
The administration of exogenous ligands such as hormones, as well as the expression of exogenous 
nuclear hormone receptors, however, may interfere with the normal functions of endogenous nuclear 
hormone receptors. On the other hand, the exogenous nuclear hormone receptors may respond to 
endogenous hormones. 

25 In some instances, the ligand-binding domain of the nuclear hormone receptors were 

modified to decrease the mutual interference. For example, a chimeric transcription factor was 
constructed by fusing a carboxy-terminal deletion mutant of the ligand-binding domain (LBD) of 
the human progesterone receptor to GAL4 DNA-binding domain and VP16 activation domain. 
While not responsive to progesterone, the chimeric transcription factor was activated by RU486, 

30 a synthetic progesterone antagonist (Wang Y. et al. 1994, PNAS 91:8180-8184). However, the 
system include exogenous ligands which interfere with the activities of the corresponding 
nuclear hormone receptors. Hence, there is a need to develop novel gene switches that neither 
affect the normal functions of endogenous nuclear hormone receptors, nor are affected by 
endogenous hormones. 
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The references cited throughout the present application are not admitted to be prior art to 
the claimed invention. 

SUMMARY OF THE INVENTION 
5 The present invention provides a polypeptide that comprises, 

SAGDMRAANL WPSPLMIKRS KKNSLALSLT ADQMVSALLD 

AEPPILYSEY DPTRPFSEAS MMGLLTNLAX, RELVHMINWA 

KRVPGFVDLT LHDQVHLLEC AWMEILMIG^ VWRSMEHPGK 

LL^APNLLLD RNQGKCVEGX, VEX 5 FDMX 6 LAT SSRFRMMNLQ 
10 GEEFVCLKSI ILLNSGVYTF LSSTLKSLEE KDH IHRVLDK 

ITDTLIHLMA KAGLTLQQQH QRLAQLLLIL SHIRHMSNKX, 

MEX 8 LYSMKCK NWPLYDLLL EMLDAHRLHA PTSRGGASVE 

ETDQSHLATA GSTSSHSLQK YYITGEAEGF PATV 
in the peptide, X x = D or A; are each independently G, A, C, V, I, L; M, F, Y, or W; X 7 = G or R; and 
15 Xs = HorV. 

According to an embodiment of the present invention, in the polypeptide, X 2 = L, M, or 
V; X 3 = F or W; X4 = M, G or A; X 5 = I, M, V, or L; Xe = L. 

According to a preferred embodiment of the present invention, in the polypeptide, Xi = 
D; X 2 = L, M, or V; X 3 = F or W; X4 = M, G or A; X 5 = I, M, V, orL; X* = L; X 7 = G; and X 8 = H. 
20 According to an embodiment of the present invention, the polypeptide comprises the 

amino acid sequence of SEQ ID NO: 2. 

According to a preferred embodiment of the present invention, in the polypeptide, X! = 
D;X2 = L,M,orV;X 3 =ForW;X4 = GorA;X 5 = I,M, V, or L; Xs = L; X 7 = G; and X 8 = H. 

According to a further preferred embodiment of the present invention, the polypeptide 
25 comprises an amino acid sequence selected from the group consisting of SEQ ID NO: 3-15. 

According to a preferred embodiment of the present invention, in the polypeptide, Xi = 
A;X 2 = L,M,orV;X 3 = ForW;X4 = Gor A;X 5 = I,M, V,orL;X 6 = L;X 7 = G; andXs = V. 

According to a further preferred embodiment of the present invention, the polypeptide 
comprises an amino acid sequence selected from the group consisting of SEQ ID NO: 16-28. 
30 According to an embodiment of the present invention, in the polypeptide, Xi = D; X 2 = L, 

M, orV; X 3 = F or W; X4 = G or A; X 5 = I,M, V,orL;X6 = L;X 7 = R; and X 8 = H. 

According to a further preferred embodiment of the present invention, the polypeptide 
comprises an amino acid sequence selected from the group consisting of SEQ ID NO: 29-41. 

The present invention provides a polynucleotide encoding the polypeptide. 
35 The present invention also provides a transcription factor that comprises a DNA-binding 

domain, a ligand-binding domain comprising the polypeptide, and a transcription regulatory domain. 
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According to an embodiment of the present invention, the ligand-binding domain of the 
transcription factor comprises an amino acid sequence selected from the group consisting of SEQ ID NO: 
16-41. 

According to an embodiment of the present invention, the DNA-binding domain of the 
5 transcription factor is GAL4 minimal DNA-binding domain. 

According to a preferred embodiment of the present invention, the DNA-binding domain 
of the transcription factor is the DNA-binding domain of HNF-1. 

According to an embodiment of the present invention, the transcription regulatory 
domain of the transcription factor is VP16 minimal activation domain. 
10 According to an alternative embodiment of the present invention, the transcription 

regulatory domain of the transcription factor is a portion of the activation domain of human p65. 

According to a preferred embodiment of the present invention, the transcription factor 
comprises SEQ ID NO: 43. 

The present invention provides a polynucleotide encoding the transcription factor. 
15 The present invention also provides a host cell transformed with a composition 

comprising the transcription factor. 

The present invention also provides a compound that binds to and activates the 
transcription factor. 

According to a preferred embodiment of the present invention, the compound is selected 
20 from the group consisting of CMP1 and CMP4-38. 

The present invention also provides an orthogonal gene switch for regulating the 
expression of a desired gene. The gene switch comprises the transcription factor, and a vector comprising 
the desired gene, and a regulatory region that is fused to the desired gene. The transcription factor is 
capable of binding to the regulatory region. 
25 According to an embodiment of the present invention, the gene switch further comprises 

a compound that binds to the ligand-binding domain and activates the transcription factor. 

According to a preferred embodiment of the present invention, the ligand-binding domain 
of the gene switch comprises an amino acid sequence selected from the group consisting of SEQ ED NO: 
16-28, and the compound is selected from the group consisting of CMP1, CMP4, CMP5, and CMP1 1-38. 
30 According to a preferred embodiment of the present invention, the ligand-binding domain 

of the gene switch comprises an amino acid sequence selected from the group consisting of SEQ ID NO: 
29-41, and the compound is selected from the group consisting of CMP6-10. 

The present invention further provides a method of making an orthogonal gene switch. 
The method comprises selecting a ligand-binding domain (LBD) from a nuclear hormone receptor, 
35 selecting an inactive analogue of the hormone; constructing a library of transcription factors comprising 
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veneered variants of the selected LBD, which are created by mutating amino acid residues that hinder the 
binding of the selected inactive analogue to various amino acid residues that might facilitate the binding; 
and screening the library with the selected inactive analogue to select the transcription factors that are 
activated by the inactive analogue. 
5 According to an embodiment of the present invention, the method of making an 

orthogonal gene switch further comprises introducing mutations into the veneered LBDs of the selected 
transcription factors to reduce their affinity to the hormone and the ligand-independent activity of the 
transcription factors. 

According to another embodiment of the present invention, the method further comprises 
10 making inactive analogues that are capable of activating the transcription factors carrying the mutations. 

According to an embodiment of the present invention, the nuclear hormone receptor used 
in the method is selected from the group consisting of estrogen receptor (ER), androgen receptor (AR), 
glucocorticoid receptor (GR), mineralocorticoid receptor (MR), progesterone receptor (PR), vitamin D 3 
receptor (VDR), thyroid hormone receptor (TR), and retinoic acid receptor (RAR). 
15 According to a preferred embodiment of the present invention, the nuclear hormone 

receptor used in the method is human estrogen receptor a (hERa). 

According to a further preferred embodiment of the present invention, the nuclear 
hormone receptor used in the method comprises SEQ ID NO: 2. 

According to an embodiment of the present invention, the inactive analogue used in the 
20 method is an inactive analogue of a nuclear hormone receptor-specific agonist or antagonist. 

According to an embodiment of the present invention, the inactive analogue used in the 
method is an inactive analogue of hERa-specific agonist or antagonist. 

According to a preferred embodiment of the present invention, the inactive analogue used 
in the method is an inactive analogue of a human estrogen receptor |3 (hER0)-specific agonist or 
25 antagonist 

According to a further preferred embodiment of the present invention, the inactive 
analogue is CMP1. 

According to a preferred embodiment of the present invention, the library used in the 
library is a yeast one hybrid system. 
30 According to a preferred embodiment of the present invention, the transcription factor 

used in the method further comprises a GAL4 minimal DNA-binding domain (DBD) and a VP16 minimal 
activation domain (AD). 

According to a further preferred embodiment of the present invention, the library of 
transcription factors used in the method contains veneered LBDs with their amino acid residues 391, 404, 
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421, 424, and 428 independently selected from the group consisting of Gly, Ala, Cys, Val, lie, Leu, Met, 
Phe, Tyr, and Tip. 

Other features and advantages of the present invention are apparent from the additional 
descriptions provided herein including the different examples. The provided examples illustrate different 
5 components and methodology useful in practicing the present invention. The examples do not limit the 
claimed invention. Based on the present disclosure the skilled artisan can identify and employ other 
components and methodology useful for practicing the present invention. 

BRIEF DESCRIPTION OF THE DRAWINGS 

10 

Figures 1A and IB. The structures and binding specificities of estradiol and representative active (CMP 2 
and 3) and inactive analogues (CMP 1, 4-10). MG-LBD corresponds to the hERa -LBD with L(384)M 
andM(421)G. 

15 Figure 2A. Schematic representation of the GAL4DBD/hERa LBD/VP16AD chimeric transcription 

factors. Figures 2B and 2C. Transcriptional activation in yeast transformants expressing chimeras based 
on wt hERa-LBD (squares) or hERa-L(384)M-LBD (circles) by estradiol and the hERfi-specific 
compound CMP2. Dose-response curves of P-Galactosidase activity in the presence of increasing 
concentrations of E2 (Figure 2B) or CMP2 (Figure 2C) were performed. EC 50 values were determined as 

20 described in Material and Methods. 

Figure 3. The design of the hERa-LBD mutant library: molecular models of the three rotamers of CMP 1 
with solvent accessible surface in the crystal structure of the hERP binding pocket The five amino acid 
residues mutagenized in the library are highlighted. 

25 

Figure 4. DNA sequence analysis of plasmids rescued from the genetically-selected variants. The 
number of independent clones in which the same mutation array was found is indicated. The consensus 
mutation of M421 into a smaller amino acid (G or A) is consistent with the R9a benzyl substituent of 
CMP1 adopting rotamer conformation 1. 

30 

Figures 5A-5C. Ligand-dependent transcriptional activity of the M(421)G selected mutant in the lacZ 
reporter yeast strain Y187. Dose-response curves of P-Galactosidase activity in the presence of increasing 
concentrations of the two fluorenone compounds CMP4 (Fig. 5B), CMP5 (Fig. 5 A), or E2 (Fig. 5Q were 
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determined for the L(384)M, M(421)G selected mutant (squares) and for the L(384)M parental clone 
(circles). EC 5 o values were calculated as described in Material and Methods. 

Figures 6 A and 6B. Mutagenesis of the ER amino acid residues making contacts with the D-ring of 
5 estradiol. The dose-response curves of P-Galactosidase activity in the presence of increasing 

concentrations of estradiol (Fig. 6A) or CMP4 (Fig. 6B) were determined for the triple mutated L(384)M, 
M(421)G, H(524)V chimera (filled triangles) and for the double mutated L(384)M, M(421)G selected 
chimera (filled squares). EC 50 values were calculated as described in Material and Methods. 

10 Figures 7A and 7B. The D(35 1)A mutation reduce the ligand-independent activity of transcription factor 
carrying the hERaLBDL(384)M,M(421)G,H(524)V. Figure 7A. In yeast. Dose-response of (3- 
Galactosidase activity in the presence of increasing concentrations of CMP4. Figure 7B. In HeLa cells. 
Co-transfections of the indicated amounts of GAL4DBD/hERa-LBDA^P16AD and GAL4DBD/hERcc- 
D(351)A,L(384)M,M(421)G,H(524)V-LBDA^P16AD expression vector DNA with 1 (ig of 5GAL4UAS- 

15 pSEAP reporter plasmid DNA were performed as described in Material and Methods. 24 hours post- 

transfection the outlined ligands were added at a concentration of 1 pM and after additional 24 hours the 
supernatants were collected for SEAP enzymatic assay. 

Figures 8A-8C. Transcriptional induction of the M(421)G selected mutant containing the additional 
20 G(521)R mutation in response to antagonistic fluorenone compounds in yeast. Dose-response curves of fJ- 
Galactosidase activity in the presence of increasing concentrations of the two fluorenone compounds 
CMP6 (Fig. 8B) or CMP7 (Fig. 8C) in comparison with 4-OH Tarn (Fig. 8A) were determined for the 
L(384)M, M(421)G, G(521)R mutant (filled squares), for the L(384)M, G(521)R parental clone (circles) 
and for wt alpha (grey triangles). EC50 values were calculated as described in Material and Methods. 

25 

Figure 9A. A representative dose-response curve chimeric transcription factor HEA-2 with CMP8 
compound. Figure 9B. The responses of chimeric transcription factor HEA-2 to various compounds. 

DETAILED DESCRIPTION OF THE INVENTION 

30 

As used herein, "nuclear hormone receptor superfamily" refers to the superfamily of 
nuclear hormone receptors, whose primary sequence suggests that they are related to each other. 
Representative examples include receptors for the estrogen, progesterone, glucocorticoid, 
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mineralocorticoid, androgen, thyroid hormone, retinoic acid, retinoid X, Vitamin D, COUP-TF, ecdysone, 
Nurr-1, and orphan receptors. 

The nuclear hormone receptors are composed of an activation domain, a DNA-binding domain 
and a ligand-binding domain. The DNA-binding domain recognizes and binds to specific regulatory 
5 DNA sequence elements and the ligand-binding domain binds the specific biological compound (ligand) 
to activate the receptor. 

As used herein, "ligand" refers to any compound that activates or represses the receptor, by 
interaction with (binding) the ligand-binding domain of the receptor. 

As used herein, "hormone" refers the natural ligand of the nuclear hormone receptor. 
10 As used herein, "agonist" is a compound that is capable of interacting with the nuclear hormone 

receptor to promote a transcriptional response. For example, estrogen is an agonist for the estrogen 
receptor. Compounds that mimic estrogen would be defined as nuclear hormone receptor agonists. 

As used herein, "antagonist" is a compound that is capable of interacting with a nuclear hormone 
receptor and of blocking the activity of a receptor agonist 
15 As used herein, "inactive analogues" refer to compounds that are structurally related to the 

ligands of a selected nuclear hormone receptor, but do not bind to the ligand-binding domain of the 
receptor. Inactive analogues are normally not found in animals or humans. 

As used herein, "genetic material" refers to contiguous fragments of DNA or RNA. The genetic 
material which is introduced into targeted cells according to the methods described herein can be any 
20 DNA or RNA. For example, the nucleic acid can be: (1) normally found in the targeted cells, (2) 

normally found in targeted cells but not expressed at physiologically appropriate levels in targeted cells, 

(3) normally found in targeted cells but not expressed at optimal levels in certain pathological conditions, 

(4) novel fragments of genes normally expressed or not expressed in targeted cells, (5) synthetic 
modifications of genes expressed or not expressed within targeted cells, (6) any other DNA which may be 

25 modified for expression in targeted cells and (7) any combination of the above. 

As used herein, "nucleic acid cassette" refers to the genetic material of interest which can express 
a protein, or a peptide, or RNA after it is incorporated transiently, permanently or episomally into a cell. 
The nucleic acid cassette is positionally and sequentially oriented in a vector with other necessary 
elements such that the nucleic acid in the cassette can be transcribed and, when necessary, translated in 

30 the cells. 

As used herein, "veneered ligand-binding domain" or "mutant variant" refers to a ligand-binding 
domain with such an alternation or alternations of the primary sequence of the ligand-binding domain 
(LBD) of a receptor such that it differs from the wild type or naturally occurring sequence. The alteration 
can be point mutation, insertion, or deletion. 
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The terms "chimeric" and "chimera" refers to fusion proteins and transcription factors, activators 
and repressors of the invention, to denote composition of components of different origin, in particular of 
different parent proteins. This is irrespective of any inter-species chimericity, and indeed, in preferred 
embodiments a chimeric transcription factor of the invention is composed only of human protein 
5 components. 

As used herein, "plasmid" refers to a construction comprised of extrachromosomal genetic 
material, usually of a circular duplex of DNA that can replicate independently of chromosomal DNA. 
Plasmids are used in gene transfer as vectors. 

As used herein, "vector" refers to a construction comprised of genetic material designed to direct 
10 transformation of a targeted cell. A vector contains multiple genetic elements positionally and 

sequentially oriented with other necessary elements such that the nucleic acid in a nucleic acid cassette 
can be transcribed and when necessary translated in the transfected cells. In the present invention the 
preferred vector comprises the following elements linked sequentially at appropriate distance for allowing 
functional expression: a promoter; a 5 f mRNA leader sequence; an initiation site; a nucleic acid cassette 
15 containing the sequence to be expressed; a 3 1 untranslated region; and a polyadenylation signal. 

As used herein the term "expression vector" refers to a DNA plasmid that contains all of the 
information necessary to produce a recombinant protein in a heterologous cell. 

As used herein, "transformed" refers to transient, stable or persistent changes in the 
characteristics (expressed phenotype) of a cell by the mechanism of gene transfer. Genetic material is 
20 introduced into a cell in a form where it expresses a specific gene product or alters the expression or 
effect of endogenous gene products. One skilled in the art readily recognizes that the nucleic acid 
cassette can be introduced into the cells by a variety of procedures, including transfection and 
transduction. 

As used herein, "transfection" refers to the process of introducing a DNA expression vector into a 
25 cell. Various methods of transfection are possible including microinjection, CaP0 4 precipitation, 
liposome fusion (e.g. lipofection) or use of a gene gun. 

As used herein, "transduction" refers to the process of introducing recombinant virus into a cell 
by infecting the cell with a virus particle. 

As used herein, "transient" relates to the introduction of genetic material into a cell to express 
30 specific proteins, peptides, or RNA, etc. The introduced genetic material is not integrated into the host 
cell genome or replicated and is accordingly eliminated from the cell over a period of time. 

As used herein, "stable" refers to the introduction of genetic material into the chromosome of the 
targeted cell where it integrates and becomes a permanent component of the genetic material in that cell. 
Gene expression after stable transduction can permanently alter the characteristics of the cell leading to 
35 stable transformation. 
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As used herein, "persistent" refers to the introduction of genes into the cell together with genetic 
elements that enable episomal (extrachromosomal) replication. This can lead to apparently stable 
transformation of the characteristics of the cell without the integration of the novel genetic material into 
the chromosome of the host cell. 
5 As used herein, "transcriptional activity" is a relative measure of the degree of RNA polymerase 

activity at a particular promotor. 

As used herein, the abbreviations of amino acid residues are shown as follows: 





Alanine 


Ala 


A 




Arginine 


Arg 


R 


10 


Asparagine 


Asn 


N 




Aspartate 


Asp 


D 




Cysteine 


Cys 


C 




Histidine 


His 


H 




Isoleucine 


He 


I 


15 


frl 1 1 1 ami n p 
v_i x u ta.il ii nw 


Gin 






Glutamate 


Glu 


E 




Glycine 


Gly 


G 




Leucine 


Leu 


L 




Lysine 


Lys 


K 


20 


Methionine 


Met 


M 




Phenylalanine 


Phe 


F 




Proline 


Pro 


P 




Serine 


Ser 


S 




Threonine 


Thr 


T 


25 


Tryptophan 


Tip 


W 




Tyrosine 


Tyr 


Y 




Valine 


Val 


V 



L A Strategy to Construct An Orthogonal Gene Switch System 
30 Nuclear hormone receptors, as well as their engineered derivatives, have been used as gene 

switches. The major problem for the gene switches is the mutual interference between the gene switches 
and the endogenous gene regulation system. On the one hand, the administered exogenous ligands can 
bind to the endogenous nuclear hormone receptors and activate or repress their activities. On the other 
hand, the expressed exogenous nuclear hormone receptor can bind endogenous hormone and respond to 
35 internal stimuli. 
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Hence, in order for a gene switch to be free of mutual interference, the gene switch should 
include a novel transcription factor and an exogenous ligand with the following characteristics. The 
transcription factor binds the exogenous ligand, but not any endogenous hormones, while the exogenous 
ligand does not bind to any endogenous nuclear hormone receptors. The binding of the exogenous ligand 
5 activates or represses the novel transcription factor. In other words, the exogenous ligand is the only 
agonist or antagonist of the novel transcription factor. Such a gene switch free of mutual interference is 
also known as orthogonal gene switch. 

As used herein, "do not bind" or "incapable of binding" refers to no detectable binding, or an 
insignificant binding, i.e., having a binding affinity much lower than that of the natural ligand. The 
10 affinity can be determined with competitive binding experiments that measure the binding of a receptor 
with a single concentration of labeled ligand in the presence of various concentrations of unlabeled 
ligand. Typically, the concentration of unlabeled ligand varies over at least six orders of magnitude. 
Through competitive binding experiments, IC 50 can be determined. As used herein, "IC 50 " refers to the 
concentration of the unlabeled ligand that is required for 50% inhibition of the association between 
15 receptor and the labeled ligand. IC 50 is an indicator of the ligand-receptor binding affinity. Low IC50 
represents high affinity, while high IC 5 o represents low affinity. 

According to an embodiment of the present application, an orthogonal gene switch is constructed 
by developing inactive analogues of a known nuclear hormone as the exogenous ligand, and veneering 
the ligand-binding domain of the nuclear hormone receptor for the orthogonal transcription factor. The 
20 inactive analogues are incapable of binding to the receptor. The veneered ligand-binding domain is 
capable of binding the inactive analogue, but not its naturally occurring counterpart 

According to a preferred embodiment of the present application, a nuclear hormone receptor is 
selected to provide its ligand-binding domain (LBD), which is veneered for the orthogonal transcription 
factor. The structures of the LBD and its ligands and their interactions with are preferrably well 
25 understood. 

Based on the understanding of the structures and interactions, inactive analogues of the ligands 
are synthesized and selected. The inactive analogues are compounds that are structurally related to the 
ligands of a selected nuclear hormone receptor, but do not bind to the ligand-binding domain of the 
receptor. Inactive analogues are normally not found in animals or humans. 

30 The ligand-binding domain (LBD) of the receptor is then veneered to bind the selected inactive 

analogue and be activated by the binding. The LBD is also veneered to diminish its capability of binding 
the nuclear hormone and the ligand-independent activity of the transcription factor carrying it. 

According to a preferred embodiment of the present invention, a library of transcription factors 
with mutant variants of the LBD is constructed. The mutant variants are made by mutating amino acid 

35 residues that might hinder the binding of the inactive analogue into various amino acid residues that 
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might facilitate the binding. The library is then screened with the selected inactive analogue to select the 
transcription factors that can be activated by the inactive analogue. The selected transcription factors 
contain veneered LBDs that are capable of binding the inactive analogue. Further mutations are then 
introduced into the LBDs to reduce its binding of the hormone, and the ligand-independent transcription 
activity. 

The veneered LBDs can then be used to contract appropriate transcription factors for the 
orthogonal gene switch system, while the selected inactive analogue can be used as exogenous ligand for 
the system. A new series of exogenous ligands for the system can also be obtained by developing 
inactive analogues that fit into the structure of the veneered LBD. 

According to an embodiment of the present application, the ligand-binding domain (LBD) is from 
a nuclear receptor selected from the group consisting of estrogen receptor (ER), androgen receptor (AR), 
glucocorticoid receptor (GR), mineralocorticoid receptor (MR), progesterone receptor (PR), vitamin D 3 
receptor (VDR), thyroid hormone receptor (TR), and retinoic acid receptor (RAR). The structure of the 
LBD and the interaction with its ligands is preferably well understood to provide guidance for veneering 
the LBD. 

According to a preferred embodiment of the present application, the ligand-binding domain 
(LBD) is from human estrogen receptor a (hERa). The amino acid sequence of hERa LBD is provided 
bySEQIDNO: 1: 

SAGDMRAANL WPSPLMIKRS KKNSLALSLT ADQMVSALLD AEPPILYSEY 
DPTRPFSEAS MMGLLTNLAD RELVHMINWA KRVPGFVDLT LHDQVHLLEC 
AWLEILMIGL VWRSMEHPGK LLFAPNLLLD RNQGKCVEGM VEIFDMLLAT 
SSRFRMMNLQ GEEFVCLKSI ILLNSGVYTF LSSTLKSLEE KDHIHRVLDK 
ITDTLIHLMA KAGLTLQQQH QRLAQLLLIL SHIRHMSNKG MEHLYSMKCK 
NWPLYDLLL EMLDAHRLHA PTSRGGASVE ETDQSHLATA GSTSSHSLQK 
YYITGEAEGF PATV 

The ligand-binding domain (LBD) is located located in the carboxy-tenninal half of human 
estrogen receptor oc(hERa), from amino acid residue 282 to amino acid residue 595 of the receptor. As 
used herein, amino acid residues or point mutations of hERa LBD are numbered according to their 
correponding positions in the full-length hERot. 

According to a further preferred embodiment of the present application, the hERa LBD carries a 
Leu(384)Met mutation. The L384M mutation is both necessary and sufficient to make hERa LBD have 
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the binding specificities of hER(3 LBD. The amino acid sequence of hERa LBD carrying the L384M 
mutation is provided by SEQ ID NO: 2: 

SAGDMRAANL WPSPLMIKRS KKNSLALSLT ADQMVSALLD AEPPILYSEY 
5 DPTRPFSEAS MMGLLTNLAD RELVHMINWA KRVPGFVDLT LHDQVHLLEC 
AWMEILMIGL VWRSMEHPGK LLFAPNLLLD RNQGKCVEGM VEIFDMLLAT 
SSRFRMMNLQ GEEFVC LKS I ILLNSGVYTF LSSTLKSLEE KDHIHRVLDK 
ITDTLIHLMA KAGLTLQQQH QRLAQLLLIL SHIRHMSNKG MEHLYSMKCK 
NWPLYDLLL EMLDAHRLHA PTSRGGASVE ETDQSHLATA GSTSSHSLQK 
10 YYITGEAEGF PATV 

H The Structures of the Exogenous Ligands and the Methods of Synthesizing Them 
As discussed above, after a nuclear hormone receptor is selected to provide its ligand-binding 
domain (LBD), inactive analogues of the nuclear hormone are synthesized and selected to direct the 
15 veneering of the LBD for the othogonal gene switch system. New inactive analogues can also be 

synthesized to fit the structure of the veneered LBD. According to a preferred embodiment of the present 
application, an estrogen receptor is selected to provide its LBD, and the inactive analogues of estrogen are 
synthesized and selected. 

According to a preferred embodiment of the present application, the inactive analogues are the 
20 compounds described by the following chemical formulae: 




or 
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or 




wherein X is selected from the group consisting of: O, N-OR a N-NR a R b and Ci_6 alkylidene, wherein 

said alkylidene group is unsubstituted or substituted with a group selected from hydroxy, 
amino, OCCj^alkyl), NHCC^alkyl), orNCCj^alkyl)^ 

R 1 is selected from the group consisting of hydrogen, OR b , NR b R c , fluoro, chloro, bromo, iodo, cyano, 
10 and nitro; 

R 2 is selected from the group consisting of Cj.ioalkyl, C2-ioalkenyl, C2_ioalkynyl, cycloalkylalkyl, 

arylalkyl and heteroarylalkyl, wherein said alkyl, alkenyl, alkynyl, cycloalkylalkyl, 
arylalkyl and heteroarylalkyl groups can be optionally substituted with a group selected 
from OR b , SR b , C(=0)R b 1-5 fluoro, chloro, iodo, cyano; 

15 R3 is selected from the group consisting of hydrogen, chloro, bromo, iodo, cyano, NR a R c , OR a , 

C(=0)R a C0 2 R c , CONR a R c , C 1 . 10 alkyl v C 2 _ 10 alkenyl, C 2 _ 10 alkynyl, C 3 . 7 cycloalkyl, 
cycloalkylalkyl, aryl, heteroaryl, arylalkyl, and heteroarylalkyl, wherein said alkyl, 
alkenyl, alkynyl, cycloalkyl, aryl and heteroaryl groups are either unsubstituted or 
independendy substituted with 1, 2 or 3 groups selected from fluoro, chloro, bromo, iodo, 

20 cyano, OR a , NR a R c , 0(C=0)R a , 0(C=0)NR a R c , NR a (C=0)R c , NR a (C=0)OR c , 
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C(=0)Ra C0 2 R a , CONR a R c , CSNR a R c , SR a , S(0)R a S0 2 R a S0 2 NR a R^ YR d , and 
ZYR d ; 

R 4 is selected from the group consisting of OR b , OR a , 0(C=0)R c , 0(C=0)OR c , and NH(C=0)R c ; 
R a is selected from the group consisting of hydrogen, C^galkyl, and phenyl, wherein said alkyl group 
5 can be optionally substituted with a group selected from hydroxy, amino, O(Ci^alkyl), 

NHCC^alkyl), N(C 1 ^alkyl) 2> phenyl, or 1-5 fluoro; 
R b is selected from the group consisting of hydrogen, C^alkyl, and phenyl; 
R c is selected from the group consisting of hydrogen and Cj^alkyl; 

or R a and R c , whether or not on the same atom, can be taken together with any attached 
10 and intervening atoms to form a 4-6 membered ring; 

R d is selected from the group consisting of NR b R c , OR a , C0 2 R a , 0(C=0)R a NR c (C=0)R b , CONR a R c , 
S02NR a R c , and a 4-7 membered N-heterocycloalkyl ring that can be optionally 

interrupted by O, S, NR C , or C=0; 
Y is selected from the group consisting of CR b R c , C 2 _6 alkylene and C 2 -6 alkenylene, wherein said 

15 alkylene and alkenylene linkers can be optionally interrupted by O, S, or NR C ; 

Z is selected from the group consisting of O, S, NR C , C=0, 0(C=0), (C=0)0, NR c (C=0) or (C=0)NR c . 

In the confounds of the present invention, X is preferably selected from the group consisting of 
O and N-OR a More preferably, X is selected from the group consisting of O, N-OH and N-OCH3. 

20 In the compounds of the present invention, Rl is preferably selected from the group consisting of 

hydrogen, NR b R c , fluoro, chloro, bromo, nitro and Chalky!. 

In the compounds of the present invention, R 2 is preferably selected from the group consisting of 
Cl-ioalkyl, C^joalk^yl C3^cycloalkyl, cycloalkylalkyl, arylalkyl and heteroarylalkyl, wherein said 

alkyl, alkenyl, cycloalkyl and cycloalkylalkyl, arylalkyl and heteroarylalkyl, groups can be optionally 
25 substituted with a group selected from OR b SR b C(=0)R b , or 1-5 fluoro, chloro, iodo, cyano. 

In the compounds of the present invention, R 2 is more preferably selected from the group 
consisting of arylalkyl and heteroarylalkyl, wherein said arylalkyl and heteroarylalkyl, groups can be 
optionally substituted with a group selected from OR b , SR b , C(=0)R b , or 1-5 fluoro, chloro, iodo, cyano. 

In the compounds of the present invention, R 3 is preferably selected from the group consisting of 

30 hydrogen, chloro, bromo, iodo, cyano, OR a , C(=0)R a , Ci_ioalkyl, C 2 _ioalkenyl, cycloalkylalkyl, aryl, 
heteroaryl, arylalkyl, and heteroarylalkyl, wherein said alkyl, alkenyl, aryl and heteroaryl groups are 
either unsubstituted or independently substituted with 1, 2 or 3 groups selected from fluoro, chloro, 
bromo, iodo, cyano, OR a , NR a R c , 0(C=0)NR a R c , C(=0)R a , CO^R 0 , CONR a R c , CSNR a R c , SR a YR d , 
and ZYR d 
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In the compounds of the present invention, R 3 is more preferably selected from the group 
consisting of hydrogen, chloro, bromo, iodo, cyano, Ci_ioalkyl, aryl, heteroaryl, wherein said alkyl, aryl 

and heteroaryl groups are either unsubstituted or independently substituted with 1, 2 or 3 groups selected 
from fluoro, chloro, bromo, cyano, NR a C(=0)R c , NR a R c , 0(C=0)NR a R c , C(=0)R a , C02R C , 
5 CONR a R c , SR a YR d , and ZYR d . 

In the compounds of the present invention, R 4 is preferably selected from the group consisting of 
hydrogen, 0(C=0)R c , OR a and 0(C=0)OR c . 

In the compounds of the present invention, R 4 is more preferably OH. 

The compounds of the present invention can have chiral centers and occur as racemates, racemic 
10 mixtures, diastereomeric mixtures, and as individual diastereomers, or enantiomers with all isomeric 
forms being included in the present invention. Therefore, where a compound is chiral, the separate 
enantiomers, substantially free of the other, are included within the scope of the invention; further 
included are all mixtures of the two enantiomers. 

The term "aikyl" shall mean a substituting univalent group derived by conceptual removal of one 
15 hydrogen atom from a straight or branched-chain acyclic saturated hydrocarbon (i.e., -CH3, -CH2CH3, 
-CH 2 CH 2 CH 3 , -CH(CH 3 ) 2 , -CH2CH2CH2CH3, -CH 2 CH(CH 3 ) 2 , -C(CH 3 ) 3j etc.). 

The term "alkenyl" shall mean a substituting univalent group derived by conceptual removal of 
one hydrogen atom from a straight or branched-chain acyclic unsaturated hydrocarbon containing at least 
one double bond (i.e., -CH=CH 2 , -CH 2 CH=CH 2 , -CH=CHCH 3 , -CH 2 CH=C(CH 3 ) 2 , etc.). 

20 The term "alkynyl" shall mean a substituting univalent group derived by conceptual removal of 

one hydrogen atom from a straight or branched-chain acyclic unsaturated hydrocarbon containing at least 
one triple bond (i.e., -OCH, -CH 2 OH, -OCCH3, -CH 2 OCCH 2 (CH 3 ) 2 , etc.). 

The term "alkylene" shall mean a substituting bivalent group derived from a straight or branched- 
chain acyclic saturated hydrocarbon by conceptual removal of two hydrogen atoms from different carbon 
25 atoms (i.e., -CH2CH2-, -CH2CH2CH2CH2-, <M 2 C(CH 3 ) 2 CH 2 -, etc.). 

The term "alkylidene" shall mean a substituting bivalent group derived from a straight or 
branched-chain acyclic saturated hydrocarbon by conceptual removal of two hydrogen atoms from the 
same carbon atom (i.e., =CH 2 , =CHCH 3 , =C(CH3) 2 , etc.). 

The term "alkenylene" shall mean a substituting bivalent group derived from a straight or 
30 branched-chain acyclic unsaturated hydrocarbon by conceptual removal of two hydrogen atoms from 
different carbon atoms (i.e., -CH=CH-, -CH 2 CH=CH-, CH 2 CH=CHCH 2 -, -C(CH 3 )=C(CH 3 )-, etc.). 

The term "cycloalkyl" shall mean a substituting univalent group derived by conceptual removal 
of one hydrogen atom from a saturated monocyclic hydrocarbon (i.e., cyclopropyl, cyclobutyl, 
cyclopentyl, cyclohexyl, or cycloheptyl). 
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The term "cycloalkenyl" shall mean a substituting univalent group derived by conceptual removal 
of one hydrogen atom from an unsaturated monocyclic hydrocarbon containing a double bond (i.e., 
cyclopentenyl or cyclohexenyl). 

The term "heterocycloalkyl" shall mean a substituting univalent group derived by conceptual 
5 removal of one hydrogen atom from a heterocycloalkane wherein said heterocycloalkane is derived from 
the corresponding saturated monocyclic hydrocarbon by replacing one or two carbon atoms with atoms 
selected from N, O or S. Examples of heterocycloalkyl groups include, but are not limited to, oxiranyl, 
azetidinyl, pyrrolidinyl, piperidinyl, piperazinyl, and morpholinyl. 

The term "aryl" as used herein refers to a substituting univalent group derived by conceptual 
10 removal of one hydrogen atom from a monocyclic or bicyclic aromatic hydrocarbon. Examples of aryl 
groups are phenyl, indenyl, and naphthyl. 

The term "heteroaryl" as used herein refers to a substituting univalent group derived by the 
conceptual removal of one hydrogen atom from a monocyclic or bicyclic aromatic ring system containing 
1, 2, 3, or 4 heteroatoms selected from N, O, or S. Examples of heteroaryl groups include, but are not 
15 limited to, pyrrolyl, furyl, thienyl, imidazolyl, pyrazolyl, oxazolyl, isoxazolyl, thiazolyl, pyridyl, 
pyrimidinyl, pyrazinyl, benzimidazolyl, indolyl, and purinyl. 

The term "cycloheteroalkyl," as used herein, shall mean a 3- to 8-membered fully saturated 
heterocyclic ring containing one or two heteroatoms chosen from N, O or S. Examples of 
cycloheteroalkyl groups include, but are not limited to piperidinyl, pyrrolidinyl, azetidinyl, morpholinyl, 
20 piperazinyl. 

The term "alkoxy," as used herein, refers to straight or branched chain alkoxides of the number of 
carbon atoms specified (e.g., Ci-5 alkoxy), or any number within this range (i.e., methoxy, ethoxy, etc.). 

Whenever the term "alkyl" or "aryl" or either of their prefix roots appear in a name of a 

substituent (e.g., aryl Ci-8 alkyl) it shall be interpreted as including those limitations given above for 
25 "alkyl" and "aryl." Designated numbers of carbon atoms (e.g., Ci-io) shall refer independently to the 

number of carbon atoms in an alkyl or cyclic alkyl moiety or to the alkyl portion of a larger substituent in 

which alkyl appears as its prefix root 

The terms "arylalkyl" and "alkylaryl" include an alkyl portion where alkyl is as defined above 

and to include an aryl portion where aryl is as defined above. Examples of arylalkyl include, but are not 
30 limited to, benzyl, fluorobenzyl, chlorobenzyl, phenylethyl, phenylpropyl, fluorophenylethyl, 

chlorophenylethyl, thienylmethyl, thienylethyl, and thienylpropyl. Examples of alkylaryl include, but are 

not limited to, toluene, ethylbenzene, propylbenzene, methylpyridine, ethylpyridine, propylpyridine and 

butylpyridine. 

The term "heteroarylalkyl," as used herein, shall refer to a system that includes an arylalkyl 
35 portion, where arylalkyl is as defined above, and contains one or two heteroatoms chosen from N, O or S. 
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The term "cycloarylalkyl," as used herein, shall refer to a system that includes a 3- to 8- 
membered fully saturated cyclic ring portion and also includes an arylalkyl portion, where arylalkyl is as 
defined above. 

In the compounds of the present invention, Rl and R2, when on the same carbon atom, can be 
5 taken together with the carbon atom to which they are attached to form a 3-6 membered ring. 

In the compounds of the present invention, R a and Rb can be taken together with any of the 
atoms to which they may be attached or are between them to form a 4-6 membered ring system. 

The novel compounds of the present invention can be prepared according to the procedures of the 
following schemes and examples, using appropriate materials, and are further exemplified by the 
10 following specific examples. The compounds illustrated in the examples are not, however, to be 

construed as forming the only genus that is considered as the invention. The following examples further 
illustrate details for the preparation of the compounds of the present invention. Those skilled in the art 
will readily understand that known variations of the conditions and processes of the following preparative 
procedures can be used to prepare these compounds. 
15 The compounds of the present invention can be prepared according to the general methods 

outlined in Schemes I-VL CH 2 R^ represents non-hydrogen values of R^, or precursors thereof; R 777 
represents R3 or a precursor thereof; K^ a and R#J& represent non-hydrogen values of R^, or precursors 
thereof. R^ represents OR a and NR a R^. represents an acyl group such as acetyl or the like; and R^ 
represents a JV-protecting group for an indole, indazole, benzimidazole, or benzotriazole group. 

20 

SCHEME I 

The fundamental methods for construction of 9a-substituted l,2,9,9a-terti^ydrcH3#-fluoren-3- 
one compounds are illustrated in Scheme I, and are based on chemistry described by Cragoe, et aL 9 /. 
Med. Cherru 1986, 29, 825-841. The indanone starting material (la) of Scheme I is commercially 

25 avaiable. A bromo- (chloro) substituent as R 1 can be introduced by reaction of the unsubstituted 

indonanone (1, R 1 = H) with NBS (NCS). The 2-alky lidene- 1 -indanones (lb) are prepared by reacting 2- 
unsubstituted indanones (la) with aldehydes under basic conditions. Reduction of the double bond (step 
2) affords the indanone (lc). In step 3 of Scheme I, a 2-substituted-l -indanone (lc) is reacted with a 
vinyl ketone in the presence of base. The crude product is then cyclized (step 3) under basic or acidic 

30 conditions. After O-deprotection the tetrahydrofluorenone products (le) are obtained. 
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Representative reagents and reaction conditions indicated in Scheme I as steps 1-4 are as follows: 
Step 1 Rll CHO, EtOH, KOH, rt 

Step 2 H 2 , 10% Pd/C, EtOAc, rt 

Step 3 CH 2 =C H C(0)CH2R /// , DBN, THF, rt to60°C or 

CH2=CHC(0)CH 2 R //Z , NaOMe, MeOH, rt to60°C, 
then pyrrolidine, HOAc, THF or PhMe, 60-85°C or 
HOAc/6 N HC1, 80°C 

Step 4 BBr 3 , CH2CL2, -78°C to rt 
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SCHEME H 

Tetxahydrofluorenones of types (le) wherein is hydrogen (2a) can be functionalized at the 4- 
position by the methods illustrated in Scheme II. Bromination (step 1) affords the 4-bromo intermediates 
(2b). These compounds can be converted (step 2) by known methods into a variety of new derivatives 
5 (2c) wherein R IIIb is aryl, heteroaryl. If the group R IIIb is or contains a functional group capable of 
further modification, this can be carried out to produce additional derivatives. For example, a R^> 4.. 
hydroxy-phenyl group can be alkylated at the oxygen. 




(2a, R ; "=H) (2b) R' //a =Br 



l step2 




(2c) 

10 

Representative reagents and reaction conditions indicated in Scheme II as steps 1 and 2 are as 

follows: 

Step 1 Br 2 , NaHC0 3 , CH 2 C1 2 or CCI4, 0°C to rt Rflfa = Br 

15 

Step 2 R## SnBu 3 , Pd(PPh 3 ) 4 , PhMe, 100°C or RMb = aryl, or 

heteroaryl 
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SCHEME in 

Modifications to the C-3 ketone are outlined in Scheme m for the tetrahydrofluorenone 
derivative (3a). The methodology also applies to the other tetrahydrofluorenone products prepared 
5 according to Schemes I-EL In step 1, the ketone is reacted with a hydroxylamine or alkoxylamine reagent 
to yield the 3-imino product (3b). 




10 Representative reagents and reaction conditions indicated in Scheme IE as step 1 are as follows: 

Step 1 NH 2 OR a HCl, pyridine, EtOH rt = OR a 

SCHEME IV 

15 The principal method for constructing the tetracyclic 8,9,9a,10-tetrahydroindeno[2,l-e]indazol- 

7(3i/)-one compounds of the present invention is summarized in Scheme IV. Bxomination (step 1) of 4- 
unsubstituted-5-(acylamino)-l-mdanones (4a) provides the 4-bromo compounds (4b) which can be 
converted (step 2) into the 4-methyl derivatives (4c) using Stille methodology. The 2-unsubstituted 
intermediates (4c) are converted to the corresponding 2-substituted compounds (4e) by an aldol 

20 condsation (step 3) and subsequent reduction (step 4). 4-Methyl-l-indanones (4c) reacts with vinyl 

ketones under basic conditions to provide diketones, which are then cyclized and deacylated under acidic 
or basic conditions (step 5) to afford the 7-amino-8-methyl-tetrahydrofluorenone intermediate (4f). If the 
cyclization is accomplished using pyrrolidine and acetic acid the amino group remains protected and 
allows a further elaboration before the formation of the fused pyrazole ring. Formation of the fused 

25 pyrazole ring is accomplished by treating (4f) with a diazotizing reagent followed by cyclization of the 
diazo intermediate with KOAc and dibenzo- 1 8-cro wn-6 (step 6). 
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Representative reagents and reaction conditions indicated in Scheme IV as steps 1-6 are as 



follows: 



Stepl NBS,MeCNorDMF,60°C 



Step 2 Me 4 Sn, PdCl2(PPh 3 ) 2 , PPI13, LiCl, DMF, 100°C 



10 



Step 3 R^CHO, MeOH, NaOMe, rt 



Step 4 



H 2 , 10% Pd/C, EtOAc, rt 
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Step 5 CH 2 =CHC(0)CH 2 R //z , DBN, THF, rt to60°C or 

CH 2 =CHC(0)CH 2 r/// > NaOMe, MeOH, rt to60°C, 
then pyrrolidine, HO Ac, THF or PhMe, 60-85°C or 
HOAc,6NHCl, 100°C 

5 

Step 6 i) NOBF 4 , CH 2 C1 2 , -45°C to 10°C 

ii) KOAc, dibenzo-18-crown-6, CH 2 C1 2 , -40°C to rt 

SCHEME V 

10 Scheme V shows a method of synthesis of tetrahydroindeno[2,l-e]indazol-7(3i/)-one compounds 

in which the R/tf substituent is introduced onto a preformed tricyclic ring system. The 4-unsubstituted 
tetrahydrofluorenone intermediate (5a), which itself is prepared by cyclization (see Scheme IV, step 5) of 
intermediate (4c) wherein R/^ is hydrogen, undergoes bromination (step 1) to afford the 4-bromo 
intermediates (5b). Deacylation (step 2), followed by pyrazole ring formation (step 3) affords the 6- 

15 bromo-tetrahydroindeno[2,l-e]indazol-7(3/0-one products (5d). The pyrazole group is ^-protected (step 
4) to give a mixture of 2- and 3-substituted derivatives (5el) and (5e2) which can be used as such or 
which can be separated and used independently. The Af-protected intermediates are converted by 
established methods (step 5) into a variety of new derivatives (5f) wherein tJHb is, inter alia, an alkyl, 
alkenyl, alkynyl, aryl, heteroaryl or arylalkyl group. Removal of the TV-protection (step 6) affords the 

20 products (5g). If the group R"^ is, or contains, a functional group capable of further modification, such 
modifications can be carried out to produce additional derivatives. For example a 4-hydroxy-phenyl 
group can be alkylated at the oxygen. 



25 
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(5a) (5b) (5c) 

| step 3 




(5f) (5g) 
Representative reagents and reaction conditions indicated in Scheme V as steps 1-6 are as 

follows: 

Step 1 Br2, NaHC0 3 , CH2CI2 or CCI4, 0°C to rt R IIIa = Br 

Step 2 NaOMe, MeOH and/or EtOH, rt to 80°C R°= acetyl 

Step 3 i) NOBF4, CH 2 Cl2. "45°C to 10°C 

ii) KOAc, dibenzo-18-crown-6, CH2CI2, -40°C to rt 

Step 4 TsCl, DMAP, CH 2 a 2 , 0°C to rt = Ts 

Step 5 R^SnBu 3 , Pd(PPh 3 ) 4 , PhMe, 100°C R IIIb = alkenyl, 

aryl, or heteroaryl 
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Step 6 NaOH, 1 ,4-dioxane-EtOH, rt R p = Ts 

SCHEME VI 

5 The principal method for constructing the 8,9,9a, 10-tetrahydro-fluorenof 1 ,2-d] [ 1 ,2,3]triazol- 

7(3H) -one compounds of the present invention is summarized in Scheme VI. After amino-protection of 
5-ammo-indane (stepl) bromination of unsubstituted 5-(acylamino)-l-indane (6b) (step 2) provides the 6- 
bromo compound (7c) which is oxidized (step 3) to give the indanone-derivative (6d). After nitration and 
deprotection of the 5-amino group a catalytic reduction (step 6) leads to the diamino-intermediate (6g), 

10 which is subjected to cyclization (step 7) to form 7,8-dihydroindeno[4,5-rf][l,2,3]triazol-6(3^0-one (6h). 
The 2-unsubstituted intermediate (6h) is converted to the corresponding 2-substituted compounds (6j) by 
an aldol condsation (step 8) and subsequent reduction (step 9). The 2-substituted 7,8-dihydroindeno[4,5- 
d][l^,3]triazol-6(3/?)-ones (6j) react with a vinyl ketone under basic conditions to provide diketones, 
which are then cyclized under acidic or basic conditions (step 10) to afford the 8,9,9a, 10- 

15 tetrahydrofluoreno-[l,2-d][l,2,3]triazol-7(3 J fiO-one derivatives (6k). 
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XX> -\jCo — "Co 



H (6c) 




H 2 N 



N=N (6j) 



Representative reagents and reaction conditions indicated in Scheme VI as steps 1-10 are as 



follows: 



Step 1 



Ac 2 0 , HO Ac, reflux 



R° = Acetyl 



Step 2 Br 2 , HOAc, 10°C 



10 Step 3 Cr0 3 , HOAC, 15°C 



Step 4 HN0 3 , -40°C 
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Step 5 



MeOH,HCl, reflux 



Acetyl 



Step 6 



Pd(C), H 2 , EtOAc, KOAc 



5 Step 7 



NaN0 2 , HC1 



Step 8 



R /7 CHO, MeOH, NaOMe, rt 



Step 9 



H 2 , 10% Pd/C, EtOAc-EtOH, rt 



10 

Step 10 CH 2 = chc (°) ch 2 r/// ' dbn » thf > rt to60°C or 

CH 2 =CHC(0)CH 2 R 7// , NaOMe, MeOH, rt to60°C, 
then pyrrolidine, HO Ac, THF or PhMe, 60-85°C or 
HOAc,6NHCl, 100°C 



In Schemes I-VI, the various R groups often contain protected functional groups which are 
deblocked by conventional methods. The deblocking procedure can occur at the last step or at an 

4 

intermediate stage in the synthetic sequence. For example, if one of R is a methoxyl group, it can be 
converted to a hydroxyl group by any of a number of methods. These include exposure to BBr3 in 

20 CH 2 C1 2 at -78°C to room temperature, heating with pyridine hydrochloride at 190-200°C, or treatment 

with EtSH and AICI3 in CH 2 C1 2 at 0°C to room temperature. Another example involves the use of 

methoxymethyl (MOM) protection of alcohols and phenols. The MOM group is conveniently removed 
by exposure to hydrochloric acid in aqueous methanol. Other well-known protection-deprotection 
schemes can be used to prevent unwanted reactions of various functional groups contained in the various 
25 R substituents. 

The specific examples I-III, while not limiting, serve to illustrate the methods of preparation of 
the l,2,9,9a-tetrahydro-3H-fluoren-3-one compounds of the present invention. The specific examples IV- 
V, while not limiting, serve to illustrate the methods of preparation of the 8,9,9a, 10-tetrahydroindeno[2,l- 
e]indazol-7(3jtf)-one compounds of the present invention. The specific example VI, while not limiting, 
30 serves to illustrate the methods of preparation of the 8,9,9a,10-tetrahydrofluoreno-[l,2-rfl[l,2,3]triazol- 
7(3#)-one compounds of the present invention. All compounds prepared are racemic, but could be 
resolved if desired using known methodologies. 

Using the methods disclosed above, various inactive analogues of estrogen, as well as active 
analogues of estrogen are synthesized, including but not limited to: CMP1, CMP2, CMP3, CMP4, CMP5 



15 
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(Fig. 1A), CMP6, CMP7, CMP8, CMP9, CMP10 (Fig. IB), and the compounds listed in tables la, lb, 2 
and 3. 

HI. The Methods of Veneering the Ligand-bindinj* Domains (LBDs^ 
5 and Structures and Sequences of the Veneered LBDs 

As discussed above, nuclear hormone receptors are modular proteins organized into structurally 
and functionally defined domains, including activation domain (AD), DNA-binding domain (DBD), and 
ligand-binding domain (LBD). Chimeric nuclear hormone receptors can be constructed by swapping the 
modular domains with their counterparts from other nuclear hormone receptors, or even proteins other 
10 than nuclear hormone receptors. The ligand-binding domain can subject the chimeric transcription factor 
to the control of the ligand binding to the domain. For instance, upon binding estrogen, the chimeric 
transcription factor with GAL4 DBD/ER LBD/VP16 AD is activated, binds to the binding site of GAL4, 
and promotes the transcription of the gene fused to the binding site. 

The ligand-binding specificities of both nuclear hormone receptors and the chimeric trancription 
15 factors are determined by their LBD. The efforts of constructing an orthogonal gene switch system was 
therefore focused on the modification of the LBD. According to an embodiment of the present 
application, the veneered LBD for the chimeric transcription factors can be obtained with selected 
inactive analogues through the following approach. 

First, mutations are introduced into the hydrophobic pocket region that surrounds the ligands to 
20 make the LBD be capable of binding the inactive analogues. The knowledge about the LBD and its 

ligands, and the structural differences between the ligands and the inactive analogues are used to guide 
the mutagenesis, leading to the production of mutant variants of the LBD. A library of transcription 
factors is constructed with the mutant variants fused to a DNA-binding domain (DBD), and an activation 
domain (AD). 

25 Second, the library is screened with an inactive analogue to select the transcription factor that is 

capable of binding the inactive analogue and is activated by binding it. The selected transcription factors 
contain mutant variants of the LBD, which are the LBDs veneered to bind the inactive analogue. 

Third, additional mutations are introduced into the selected veneered LBDs of the selected 
transcription factors, to decrease both their affinity for the natural ligand (hormone) and the ligand- 

30 independent activity of the transcription factors. The LBDs can then be used to construct the transcription 
factor appropriate for orthogonal gene switches, and the inactive analogues can be used as the exogenous 
ligands for the switches. Exogenous ligands for the switches can also be obtained by developing a new 
series of inactive analogues, based on the structures of the veneered LBDs. 
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According to a preferred embodiment of the present invention, the LBD of estrogen receptor (ER) 
was veneered, using the strategies disclosed above. The LBD is more preferrably from human estrogen 
receptor a (hERa). 

According to a preferred embodiment of the present invention, a L384M mutation is introduced 
5 into hERa LBD. The L384M mutation is both necessary and sufficient to make hERa LBD have the 
binding specificities of hERfJ LBD. Thus, the L384M hERa LBD can be veneered to bind the inactive 
analogues of hERfJ ligands, such as CMP1. Five residues within the ligand-binding pocket of L384M 
hERa LBD were identified as the most likely candidate residues interfering with binding of CMP1: 
L391, F404, M421, 1424, and L428. 
10 A library of transcription factors is constructed containing mutant variants of L384M hERa LBD, 

whose L391, F404, M421, 1424, and L428 are independently mutated into one of the following amino 
acid residues: Gly, Ala, Cys, Val, De, Leu, Met, Phe, Tyr and Trp. Each transcription factor in the library 
also contains a DNA-binding domain (DBD) and an activation domain (AD). 

The library is then screened with an inactive analogue of a hER0 ligand, such as CMP1, to select 
15 transcription factors that can be activated by the inactive analogue. The selected transcription factors 

contain veneered LBDs that are capable of binding the inactive analogue. The amino acid sequences of 
the selected LBDs are provided in SEQ ID NO: 3-15. 

According to a preferred embodiment of the present invention, mutations of D351A and H524V 
are introduced into the selected LBDs, as shown in SEQ ID NO: 3-15, to diminish their capability of 
20 binding the- hormone and to reduce the ligand-independent activity of the transcription factor carrying the 
LBD. The amino acid sequences of the veneered hERa LBDs are provided in SEQ ID NO: 16-28. 

The veneered LBDs of SEQ ID NO: 16-28 are capable of binding certain inactive analogues of 
hER|3 ligands, such as CMP1, CMP4, CMP5, and CMP 11-38 (Fig. 1A and tables 1-3), and to be 
activated by the binding, while the LBDs do not bind the natural hER ligand. 
25 Alternatively, a G521R mutation is introduced into the selected LBDs, as shown in SEQ ID NO: 

3-15, to diminish their capability of binding the natural hormone and to reduce the ligand-independent 
activity of the transcription factor carrying the LBD. The amino acid sequences of the veneered hERa 
LBDs are provided in SEQ ID NO: 29-41. 

A new series of inactive analogues of hER0 ligands, including CMP6, CMP7, CMP8, CMP9, and 
30 CMP10 (Fig. IB), are synthesized to fit to the LBDs carrying the G521R mutation. 

The veneered LBDs of SEQ ID NO: 29-41 are capable of binding certain inactive analogues of 
hERP ligands, such as CMP6, CMP7, CMP8, CMP9, and CMP10, and to be activated by the binding, 
while they do not bind the natural hER ligand. 
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The present invention also provides polynucleotides that encode the veneered hERa LBDs as 
shown in SEQ ID NO: 1-41. 

TV. Chimftnr T ranscription Factors Containing the Veneered LBDs 
5 The present invention provides a novel chimeric transcription factor. The chimeric transcription 

factor is activated or repressed by an inactive analogue of the naturally occurring ligand of the nuclear 
hormone receptor, while it does not respond to the naturally occurring ligand itself. 

According to an embodiment of the present invention, the chimeric transcription factor comprises 
the veneered ligand-binding domain (LBD) of a nuclear hormone receptor, a transcriptional regulatory 
10 domain, which can be an activation domain (AD) or a repression domain, and a DNA-binding domain 
(DBD). 

It should be noted that the three essential components of the ligand binding-dependent 
transcripton factors, namely the DNA-binding domain, the ligand-binding domain and the transcriptional 
regulatory domain, may be arranged in any order or sequence in a transactivator/transrepressor fusion 
15 protein of the invention. 

According to a preferred embodiment of the present invention, all the domains of the chimeric 
transcription factor are from, or veneered from, proteins of human origin. 

Gene expression in cells of non-human mammal is often desired. According to an alternative 
embodiment of the present invention, the domains of animal origin rather than human origin can be used. 
20 The invention therefore further pertains to any chimeric transcription factor that comprises the domains of 
mammalian species other than human, including rabbit, guinea pig, rat, mouse or other rodent, cat, dog, 
pig, sheep, goat, cattle or horse, or which is a bird, such as a chicken. 

1. The Veneered Ligand-Binding Domain (LBD) 
25 According to an embodiment of the present invention, the chimeric transcription factor comprises 

the veneered ligand-binding domain (LBD) of a nuclear hormone receptor. The veneered ligand-binding 
domain does not bind the naturally occurring ligand of the nuclear hormone receptor, but binds an 
inactive analogue of the naturally occurring ligand. Binding of the inactive analogue to the veneered 
ligand-binding domain activates the transcription factor. The inactive analogue neither binds nor 
30 activates the nuclear hormone receptor. 

According to a preferred embodiment of the present application, the novel transcription factor 
comprises a veneered ligand-binding domain (LBD) of human estrogen receptor a (hERa). 

According to a further prefeixed embodiment of the present application, the novel transcription 
factor comprises a veneered ligand-binding domain having a sequence selected from the group consisting 
35 of SEQ ID NO: 16^1. 
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2. The Transcriptional Regulatory Domain 

The transcriptional regulatory domain of the chimeric transcription factor may be any available to 
those skilled in the art. The transcriptional regulatory domain can be an activation domain (AD). 
5 Polypeptides that activate transcription in eukaryotic cells are well known in the art In particular, 

transcriptional activation domains of many DNA binding proteins have been described and have been 
shown to retain their activation function when the domain is transferred to a heterologous protein. 

Transcriptional activation domains found within various proteins have been grouped into 
categories based upon similar structural features. Types of transcriptional activation domains include 

10 acidic domains, proline-rich domains, serine/threonine-rich domains and glutamine-rich domains. 

Examples of the acidic domains include the VP16 regions already described and amino acid residues 753- 
881 of GAL4. Examples of the proline-rich domains include amino acid residues 399-499 of CTF/NF1 
and amino acid residues 31-76 of AP2. Examples of the serine/threonine-rich domains include amino 
acid residues 1-427 of ITF1 and amino acid residues 2-45 1 of ITF2. Examples of the glutamine-rich 

15 domains include amino acid residues 175-269 of Octl and amino acid residues 132-243 of SpL The 
amino acid sequences of each of the regions described above, and of other useful transcriptional 
activation domains, are disclosed in Seipel, K. et al. (EMBO J., 1992 12:4961-4968). 

In a preferred embodiment of the present invention, the activation domain is an activation domain 
(AD) of human p65 protein (Schmitz, M. L. and Bauerle, P.A., 1991, EMBO J., 10:3805-3817), more 

20 preferably comprising the region spanning amino acids 285-551 of human p65, or a transcription- 
activating portion encompassed within this region. In another embodiment, multimers of the p65 AD 
may be used. In another embodiment, multimers of portions of the p65 AD may be used. 

In another preferred embodiment of the present invention, the activation domain comprises the 
herpes simplex virus virion protein 16 (VP16) (Triezenberg, S. J. et al. (1988) Genes Dev. 2:718-729). 

25 Preferably, about 127 of the C-terminal amino acids of VP16 are used; more preferably, about 11 of the 
C-terminal amino acids (amino acids 437-447) of VP16 are used Preferably, multimers (two to four 
monomers) of this region are used; more preferably, a dimer of this region (i.e., about 22 amino acids) is 
used. Suitable C-terminal peptide portions of VP 16 are described in Seipel, K. et al. (EMBO J., 1992 
13:4961-4968). For example, a dimer of a peptide having an amino acid sequence DALDDFDLDML can 

30 be used. 

In another embodiment of the present invention, the activation domain comprises or consists of 
the AD of the PPARy-1 coactivator (PGC-1) (Puigserver P. et al., 1998, Cell, 92, 829). In one 
embodiment, the region spanning aa 1-70 of the N-terminus of PGC-1 is used (Puigserver, P., Science, 
1999, 1368-1371). In another embodiment, the region spanning aa 1-65 of the N-terminus of PGC-1 is 
35 used. In another embodiment, multimers of the PGC-1 AD, or portions of it, may be used. 
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In another embodiment, transcription is activated by an indirect mechanism, through recruitment 
of a transcriptional activation protein to interact with a fusion protein comprising DBD and regulatory 
domain. This may, for example, be via a polypeptide domain (e.g., a dimerization domain) which 
mediates a protein-protein interaction with a transcriptional activator protein, such as an endogenous 
5 activator present in a host cell. 

Other polypeptides with transcriptional activation ability in eukaryotic cells can also be used in 
an activation domain in accordance with the present invention. 

Moreover, the transcriptional regulatory domain can also be a repression domain. In other 
embodiments, chimeric transcription factors capable of repressing transcription are generated 
10 (Transcriptional Repressors). In this case, the transcription factor comprises a repression domain, which 
directly or indirectly repress transcription in eukaryotic cells. 

Polypeptides that repress transcription in eukaryotic cells are well known in the art In particular, 
transcriptional repression domains of many DNA binding proteins have been described and have been 
shown to retain their activation function when the domain is transferred to a heterologous protein 
15 (Deuschle et al., 1995, Mol. Cell. Biol. 15, 1907-1914; Freundiieb S. et al., 1999, J. Gene Medicine, 1,1). 

An example of such domains, capable of repressing instead of activating transcription, is the 
KRAB repressor domain of the human Koxl zinc finger protein (Margolin J., 1994, Proc. Natl. Acad. Sci. 
USA, 91,4509-45 13). This domain can be used either as single domain or in multimeric forms. 

20 3. The DNA-Binding Domain (DBD) 

The DNA-binding domain of the chimeric transcription factor may be any available to those 
skilled in the art. Polypeptides that bind to DNA in eukaryotic and prokaryotic cells axe well known in 
the art. In particular, DNA-binding domains of many DNA binding proteins have been described and 
have been shown to retain their DNA-binding function when the domain is transferred to a heterologous 

25 protein. 

DNA-binding domains found within various proteins have been grouped into categories based 
upon similar structural features. Types of DNA-binding domains include those with helix-turn-helix 
motif, zinc finger motif (Frankel, A. D. et al. (1988) Science 240:70-73), leucine zipper motif 
(Landschulz et al. (1989) Science 243:1681-1688), or helix-loop-helix motif (Murre, C. et al. (1989) Cell 
30 58:537-544), and those from high mobility group. The helix-turn-helix motif is a component of 

homeobox domain, which has been identified in many invertebrate and vertebrate regulators of gene 
expression. Zinc-finger motifs have been identified in TJLLIFA, and steroid hormone receptors. The 
leucine zipper motif has been found in the proto-oncoprotein Myc, Fos and Jim. The helix-loop-helix 
motif has been found in myogenic transcription factors. In addition to DNA-binding, the domains 
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containing zinc finger motif, leucine zipper motif, or helix-loop-helix motif also play a role in the 
dimerization of transcription factors. 

According to a preferred embodiment of the present invention, the DNA-binding domain is from 
a tissue-specific transcription factor, which is expressed in tissues other than the tissues that the chimeric 
5 transcription factor is desired to be exprerssed. 

According to a preferred embodiment of the present invention, the DNA-binding domain is a 
DNA-binding domain of human HNF-1. Chimeric transcription factors containing the DBD specifically 
activate (or repress) transcription of sequences controlled by HNF-1 responsive promoters. Chimeric 
transcription factors containing the HNF-1 DBD are useful for regulating, in tissues that do not express 
10 endogenous HNF-1, the level of transcription of any target gene linked to the selected HNF-1 DNA 
binding sites. 

HNF-1 (also called LF-B1 or HNF-loc) is a transcription factor that has been implicated as a 
major determinant of hepatocyte-specific transcription of several genes (Frain M. 1990, Cell, 59, 145- 
157). The consensus binding site derived from these sequences is the palindrome 

15 GGTTAAT(N)ATTAATA (SEQ ID NO: 42) (Tranche F. et la., 1997, J. Mol Biol., 266:231-245). 

Consistent with the dyad symmetry of this site, HNF-1 binds DNA as a dimer. The DNA binding domain 
is located in the first N-terminal 28 1 aa of HNF-1 (DBD = 1-28 1). 

Natural HNF-1 polypeptides are expressed at high levels in hepatocytes. They are also expressed 
in tissues other than liver, such as kidney, intestine, stomach and pancreas. However, HNF-1 proteins are 

20 not naturally expressed in several cell lines and tissues, such as muscle. 

A transgene cloned downstream of an HNF-1 -dependent promoter is not transcribed when 
delivered in cells lacking endogenous HNF-1 (Toniatti C. et al., 1990, EMBO J., 9, 4467-4475). Since 
HNF-1 is not present in muscles, a transgene cloned downstream of an HNF-1 -dependent promoter may 
be silent when delivered into muscle cells in vivo and in vitro. However, previous results obtained in 

25 vitro provide indication that such a transgene could be activated if an expression vector encoding for 
HNF-1 is co- delivered into muscles (Toniatti C. et al., 1990, EMBO J., 9, 4467-4475). 

In a preferred embodiment of the present invention, the HNF-1 DNA binding domain comprises 
or consists of residues 1-282 of human HNF-1 (Bach, et al (1990), Genomics, 8(1): 155-164 (Sequence 
accession number P20823), or a DNA- binding portion encompassed within these residues. 

30 The present invention provides a transcription factor that comprises SEQ ID NO: 43. The present 

invention also provides a polynucleotide that encodes the transcription factor as shown in SEQ ID NO: 
43. 

According to an alternative embodiment of the present invention, the DBD is GAL4 minimal 

DBD. 
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Other polypeptides with DNA-binding ability in eukaryotic and prokaryotic cells can also be used 
in a DNA-binding domain in accordance with the present invention. 

4. Other Domains of the Chimeric Transcription Factor 
5 A chimeric transcription factor of the present invention (which may be a single fusion protein) 

may further comprise one or more additional polypeptide components, such as a nuclear localization 
signal (NLS), which promotes transport into a cell nucleus. 

Nuclear localization signals typically are composed of a stretch of basic amino acids. When 
attached to a heterologous protein (e.g., a fusion protein of the invention), the nuclear localization signal 
10 promotes transport of the protein to a cell nucleus. The nuclear localization signal is attached to a 

heterologous protein such that it is exposed on the protein surface and does not interfere with the function 
of the protein. Preferably, the NLS is attached to one end of the protein, e.g. the N-terminus. The amino 
acid sequence of a non- limiting example of an NLS that can be included in a fusion protein of the 
invention is Met-Pro-Lys-Arg-Pro-Arg-Pro (SEQ ID NO: 44). Preferably, a nucleic acid encoding the 
15 nuclear localization signal is spliced by standard recombinant DNA techniques in-frame to the nucleic 
acid encoding the fusion protein (e.g., at the 5 1 end). 

V. Orthogonal Gene Switches Using the Chimeric Transcription Factor 
and the Inactive Analogues 

20 The present invention further provides an orthogonal gene switch for regulating the expression of 

a desired gene. The gene switch comprises a novel chimeric transcription factor which, as discussed 
above, is activated or repressed by an inactive analogue of the naturally occurring ligand of the nuclear 
hormone receptor, but does not respond to the naturally occurring ligand itself. 

The present invention therefore provides an orthogonal gene switch that is capable of controlling 

25 the expression of a heterologous gene or a series of heterologous genes through the application of an 
effective exogenous ligand. One of the major advantages of the present invention is that there is no 
mutual interference between the gene switches and endogenous gene regulation systems. In other words, 
the exogenous inducer cannot activate the endogenous gene expression, while the endogenous hormones 
cannot turn on the gene switch. 

30 According to an embodiment of the present invention, the orthogonal gene switch also comprises 

a construct comprising a desired gene, and a regulatory region that is fused to the desired gene. The 
transcription factor is capable of binding to the regulatory region. 

According to a preferred embodiment of the present invention, the gene switch is modulated by 
an exogenous ligand, which is the inactive analogue. Thus, the exogenous ligand can be used to control 
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the expression of the desired gene, without interferring the functions of endogenous nuclear hormone 
receptor. 

1. The Gene Desired to Be Regulated 
5 The gene desired to be regulated is a nucleotide sequence of interest whose transcription is 

regulated by the gene switch. According to an embodiment of the present invention, the desired gene 
encodes a polypeptide or peptide, an antisense sequence, a dsRNA (double-stranded RNA), an siRNA 
(short interfering RNA), or a ribozyme. 

A polypeptide whose expression may be controlled using the present invention may be selected 

10 according to the desires and aims of the person performing the invention, and may be a therapeutic 
protein or a cytotoxic protein. The type of the therapeutic protein is determined by the disease to be 
treated. For instance, The therapeutic protein used in cancer gene therapy can be cytokines (Agha- 
Mohammadi, S. and Lotze, M.T., J. Clin. Invest. 105, 1173-1176 (2000)), prodrug activating enzymes 
(Springer, C. J. and Niculescu-Duvaz, L, /. Clin. Invest. 105, 1161-1167 (2000)), antibodies, and 

1 5 tumoricidal gene products. 

Polypeptide expression may be inhibited by using appropriate nucleic acid to influence 
expression by antisense regulation, and an antisense sequence may be placed under transcriptional control 
in accordance with the present invention. The use of anti-sense genes or partial gene sequences to down- 
regulate gene expression is now well-established. Double-stranded DNA is placed under the control of a 

20 promoter in a "reverse orientation" such that transcription of the "anti-sense" strand of the DNA yields 
RNA which is complementary to normal mRNA transcribed from the "sense" strand of the target gene. 
The complementary anti-sense RNA sequence is thought then to bind with mRNA to form a duplex, 
inhibiting translation of the endogenous mRNA from the target gene into protein. Whether or not this is 
the actual mode of action is still uncertain. However, it is established fact that the technique works. 

25 Alternatively, gene expression can also be inhibited by RNAi (RNA interference), and a sequence 

coding a dsRNA or an siRNA may be placed under transcriptional control in accordance with the present 
invention. Double-stranded RNAs are capable of inhibiting gene expression in a sequence-specific 
manner in diverse organisms, such as plant, C. elegans, and Drosophilia (Hommond S. M., Nature 
Review Genetics 2, 1 10-119 (2001)). Moreover, siRNA, which is short dsRNA, has been shown to 

30 prohibit gene expression in a sequence-specific manner by causing RNAi (Elbashir, S. M., et dL % Nature 
411, 494-498 (2001); Bass, B. L., Nature 411, 428^29 (2001)). 

SiRNA can be continually produced in transfected mammalian cells, by the expression of small 
hairpin RNAs (shRNAs), which are processed into siRNA by the RNA machinaery in vivo 
(Brummelkamp T. R., et aL 9 Science 296, 550-553 (2002); Paddison, P. J., et aL 9 Genes & Dev. 16, 948- 

35 958 (2002); Paul, C. P., et aL, Nature Biotechnol. 20, 505-508 (2002)). Alternatively, SiRNA can be 
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produced in mammalian cells by the expression of both sense RNA and antisense RNA, which then 
hybridize in vivo to form siRNA (Miyagishi, M., and Taira K., Nature Biotechnol 20, 497-500 (2002)). 
The small hairpin RNA, or the sense RNA and antisense RNA, can be placed under the control of a 
promoter responsive to a chimeric transcription factor according to the present invention. 
5 Another possibility is that nucleic acid is used which on transcription produces a ribozyme, able 

to cut nucleic acid at a specific site - thus also useful in influencing gene expression. Background 
references for ribozymes include Kashani-Sabet and Scanlon (1995). Cancer Gene Therapy, 2, (3) 213- 
223, and Mercola and Cohen (1995). Cancer Gene Therapy 2, (1) 47-59. 

10 2. The Regulatory Region 

The desired gene is operatively linked to the regulartory region containing at least one 
oligonucleotide sequence to which the chimeric transcriptional factor binds. The regulatory region is 
usually located upstream (i.e., 5') to the sequence to be transcribed and, where appropriate, minimal 
promoter. The regulatory region sequence may also be operatively linked downstream (i.e., 3 1 ) of the 

1 5 nucleotide sequence to be transcribed. 

According to an embodiment of the present invention, the regulatory region may comprise single 
or mutimeric binding sites of the chimeric transcription factor. According to a preferred embodiment of 
the present invention, the regulatory region is an artificial promoter that is controlled by the chimeric 
transcription factor. The artificial promoter comprises one or multiple binding sites of the chimeric 

20 transcription factor. 

According to an embodiment of the present invention, the chimeric transcription factor comprises 
the DNA-binding domain of HNF-1; the promoter responsive to the transcription factor may comprise at 
least one binding site of HNF-1 and one or more binding sites for one or more different transcription 
factors. 

25 

3. The Exogenous Ligand 

The present invention provides exogenous ligands that are capable of binding to and activating 
the chimeric transcription factor, but incapable of binding the corresponding wild-type nuclear hormone 
receptor. The exogenous ligands are preferably inactive analogues of antagonists or agonists of the wt 
30 nuclear hormone receptor. 

According to an embodiment of the present invention, the exogenous ligand is an inactive 
analogue of a human estrogen receptor P (hER0)-specific agonist or antagonist According to a preferred 
embodiment of the present invention, the inactive analogue is selected from the group consisting of: 
CMP1, CMP4, CMP5, CMP6, CMP7, CMP8, CMP9, CMP10, and the compounds listed in tables la, lb, 
35 2 and 3. The Structures and Synthses of the compounds have been discussed above. 
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According to a preferred embodiment of the present application, the orthogonal gene switch 
comprises a chimeric transcription factor having a LBD whose amino acid sequence is selected from the 
group consisting of SEQ ID NO: 16-41. 

For the gene switches comprising an amino acid sequence selected from the group consisting of 
5 SEQ ID NO: 16-28, the exogenous ligand is selected from the group consisting of: CMP1, CMP4, CMP5, 
and CMP 11-38. 

For the gene switches comprising an amino acid sequence selected from the group consisting of 
SEQ ID NO: 29-41, the exogenous ligand is selected from the group consisting of: CMP6, CMP7, CMP8, 
CMP9, and CMP10. 

10 Expression of the sequence of interest in target cells is stimulated by administering the exogenous 

ligand to the target host cell. To stop expression of the gene of interest in cells of the subject, 
administration of the exogenous ligand is stopped. 

Where a repression domain is employed in the chimeric transcription factor, expression of the 
sequence of interest in target cells is repressed in the presence of the ligand and then stimulated by its 

15 withdrawal. To stop expression of the gene of interest in cells of the subject, the ligand is readministered. 

In both cases the level of gene expression can be modulated by adjusting the dose of the ligand 
administered to the patient. Thus, in a host cell, transcription of the desired gene may be controlled by 
altering the concentration of the exogenous ligand in contact with the host cell (e.g. adding the ligand to a 
culture medium, or administering the ligand to a host organism, etc.). 

20 

VI. The Applications of the Orthogonal Gene Switch System 
Heterologous proteins are expressed for various purposes in genetically engineered eukaryotic 
cells such as yeast cells and mammalian cells. A gene switch according to the present invention can be 
used to regulate the expression of the heterologous proteins in the eucaryotic cells. In addition, the switch 
25 provides a further advantage in eukaryotic cells where accumulation of large quantities of a heterologous 
protein can damage the cells, or where the heterologous protein is damaging such that expression for short 
periods of time is required in order to maintain the viability of the cells. 

An orthongonal gene switch according to the present invention can be widely applicable to a 
variety of situations where it is desirable to be able to regulate gene expression in host cells such as 
30 cultured eukaryotic cells. 

Such an inducible system also has applicability in gene therapy allowing the timing of expression 
of the therapeutic protein to be controlled. The present invention is therefore not only applicable to 
transformed mammalian cells but also to mammals per se. 
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Expression of the gene of interest in host cells is stimulated or repressed by administering the 
exogenous ligand to the patient, or to the cells directly. To stop expression of the gene of interest, 
administration of the exogenous ligand is stopped. 

5 1. Gene Therapy 

The invention is preferentially employed for gene therapy purposes in humans and/or for research 
purposes in non-human species. Gene therapy is the treatment of certain disorders, especially those 
caused by genetic anomalies or deficiencies, by introducing specific engineered genes into a patient's 
cells. Gene therapy has been deveolped to treat various diseases. The candidate diseases for gene therapy 
10 include cancer, cardiovascular disease, cystic fibrosis, AIDS, Gaucher' s disease, familial 
hypercholesterolemia, rheumatoid arthritis and sickle cell anemia, and muscular dystrophy. 

Regulatable gene expression is often crucial for gene therapy. The regulation of gene expression 
can be achieved with an orthorgonal gene switch according to the the present invention. Cells of a subject 
in need of gene therapy may be modified to contain (1) nucleic acid encoding the chimeric transcription 
1 5 factor in a form suitable for expression in the host cells, and (2) a sequence of interest (e.g. for therapeutic 
purposes) operatively linked to a promoter responsive to the chimeric transcription factor. 

According to an alternative embodiment of the present invention, the orthogonal gene switch is 
used in veterinary gene therapy. The invention therefore further pertains to gene therapy in mammalian 
species other than human. The non-human mammalian species can be selected from the group consisting 
20 of rabbit, guinea pig, rat, mouse, cat, dog, pig, sheep, goat, cattle and horse. Alternatively, the invention 
pertains to gene therapy in avian species, such as a chicken. 

In addition to regulatable gene expression, effective gene therapy requires efficient delivery of 
the gene switch to targeted mammalian cells. 

Gene therapy can also be used to treat diseases in combination with other therapies (W.M. 
25 Rideout m et al., Cell 109, 17-27 (2002)). 

To induce or repress transcription in vivo, the ligand may be administered to the body, or a tissue 
of interest (e.g. by injection). The ligand should be non-toxic to the body. The body to be treated may be 
that of an animal, particularly a mammal, which may be human or non-human. Suitable routes of 
administration include oral, intraperitoneal, intramuscular, i.v. The ligand can then be absorbed by the 
30 target cells. In all cases described, the concentration of the ligand will be proportional to the 
concentration of chimeric transcription factor expressed in the host cells. 

2. Other Uses of the Orthogonal Gene Switch 

Besides the use for gene therapy outlined in the previous sections, orthogonal gene switches 
35 according to the present invention can be used to: 
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1) conditionally express a suicide gene in cells, thereby allowing for elimination of the cells after 
they have served an intended function. For example, cells used for vaccination can be eliminated in a 
subject after an immune response has been generated by the subject by inducing expression of a suicide 
gene in the cells with the specific ligand. 
5 2) modulate expression of genes that are contained in recombinant viral vectors and might 

interfere with the growth of the viruses in the packaging cell lines during the production processes. These 
recombinant viruses might be derivatives of Adenoviruses, Retroviruses, Lentiviruses, Herpesviruses, 
Adenoassociated viruses, and other viruses that are familiar and obvious to those skilled in the art. 

3) provide large-scale production of a toxic protein of interest using cultured cells in vitro that do 
10 not contain endogenous active transcription factors that bind the binding site. The cultured cells have 

been modified to contain a nucleic acid encoding the chimeric transcription factor in a form suitable for 
expression of the transcription factor in the cells and a gene encoding the protein of interest operatively 
linked to a promoter responsive to the transcription factor. 

4) determine whether the functions of a gene. The expression of an endogenous gene can be 
15 changed by the orthogonal gene switch. For instance, a chimeric transcription factor can be used to 

control the production of an SiRNA to supress the expression of the target gene, and determine the 
functions of the gene according to the phenotype caused by the suppression. 

One convenient way of producing a polypeptide or fusion protein according to the present 
invention is to express nucleic acid encoding it, by use of nucleic acid in an expression system- 
20 Accordingly, the present invention also provides in various aspects nucleic acid encoding the 

transcriptional activator or repressor of the invention, which may be used for production of the encoded 
protein. 

3. Delivery of the Gene Switch to the Host Cells 

25 To be effectively delivered to the target cells, gene switches according to the present invention 

can be carried by appropriate vectors. Suitable vectors can be chosen or constructed, containing 
appropriate regulatory sequences, including promoter and/or enhancer sequences responsive to a chimeric 
transcription factor according to the present invention, terminator fragments, polyadenylation sequences, 
sequences, marker genes, and other sequences as appropriate. Moreover, tissue-specific regulatory 

30 elements can also be used in the vectors, to regulate expression of a polypeptide or fusion protein 
preferentially in a particular cell type. 

For further details of the DNA recombinant techniques used in vector construction, see, for 
example, Molecular Cloning: a Laboratory Manual: 3rd edition, Sambrook et al., 2001, Cold Spring 
Harbor Laboratory Press. Many known techniques and protocols for manipulation of nucleic acid, for 

35 example in preparation of nucleic acid constructs, mutagenesis, sequencing, introduction of DNA into 
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cells and gene expression, and analysis of proteins, are described in detail in Current Protocols in 
Molecular Biology, Ausubel et al. eds., John Wiley & Sons, 2003. 

In gene therapy, the nucleic acid according to the present invention can be introduced into the 
host cells via either in vivo approach or ex vivo approach. Under the in vivo approach, nucleic acid is 
5 directly delivered to the target cells in the subject to be treated. Under the ex vivo approach, target cells 
are first taken from the treated subject, transfected with the nucleic acid in vitro, and are then implanted 
or otherwise administered back to the treated host. 

For both in vivo and ex vivo gene delivery, appropriate vectors include plasmid, viral vector, and 
liposome. According to an embodiment of the present invention, the recombinant expression vector is a 
10 plasmid. 

In a preferred embodiment of the present invention, the recombinant, expression vector is a 
veneered virus, or portion thereof, which allows for expression of a nucleic acid introduced into the viral 
nucleic acid. For example, replication defective retroviruses, adenoviruses and adeno- associated viruses 
can be used. The genome of a virus such as adenovirus can be manipulated such that it encodes and 

15 expresses a chimeric transccription factor according to the present invention, but is inactivated in terms of 
its ability to replicate in a normal lytic viral life cycle. Protocols for producing recombinant retroviruses 
and for infecting cells in vitro or in vivo with such viruses can be found in Ausubel, et al. (supra). 

Vectors such as viral vectors have been used to introduce nucleic acid into a wide variety of 
different target cells. Typically the vectors are exposed to the target cells so that transfection can take 

20 place in a sufficient proportion of the cells to provide a useful therapeutic or prophylactic effect from the 
expression of the desired polypeptide. 

A variety of vectors, both viral vectors and plasmid vectors, are known in the art (see, e.g., US 
Patent No. 5,252,479 and WO 93/07282). In particular, a number of viruses have been used as gene 
transfer vectors, including papovaviruses, such as SV40, vaccinia virus, herpesviruses, including HSV 

25 and EB V, retroviruses, adenoviruses, and adeno-associated viruses. 

The nucleic acid may be on an extra-chromosomal vector within the cell, or otherwise 
identifiably heterologous or foreign to the cell. An example of extra-chromosomal vectors is adenoviral 
vector. Adenovirus can infect a broad range of human cells, including those of the lung, liver, blood 
vessels and brain, but cannot integrate into the genome of the host cell. The treatment using an extra- 

30 chromosomal vector may have to be repeated periodically. 

Alternatively, the transfected nucleic acid may be permanently incorporated into the genome of 
each of the targeted cells, providing long lasting effect. Adeno-associated virus, retrovirus, including 
oncoretrovirus and lentivirus, are also capable of infecting various human cells, and integrate into the 
genome of the host cell. Many gene therapy protocols in the prior art have used disabled murine 

35 retroviruses, and more recently, lentiviruses. Lentivirual vectors, like oncoretroviral vectors, can 
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integrate into the genome of nondividing cells, such as hematopoietic stem cells in their primitive state. 
Viral vectors derived from these viruses can therefore be constructed as integrating vector. 

Integration may also be promoted by inclusion of sequences that promote recombination with the 
genome, in accordance with standard techniques. Recombinant adenoviruses have been among the most 
5 widely explored vector systems for delivering genes to mammalian cells. However, it is difficult to 

achieve long-term gene expression using recombinant adenoviral vectors, because adenovirus is incapable 
of integrating into the genome of host cells. One solution to this problem is the recombinant adenovirus 
carrying "Sleeping Beauty" transposon machinary and Flp recombinase. The recombinant virus can 
integrate the desired transgene into the genome of the host cells (Yant, S.R., et cd 9 Nat BiotechnoL 20, 

10 999-1005 (2002)). 

As an alternative to the use of viral vectors, other known methods of introducing nucleic acid into 
cells in gene therapy includes transfer mediated by liposomes and direct DNA uptake and receptor- 
mediated DNA transfer. Receptor-mediated gene transfer, in which the nucleic acid is linked to a protein 
ligand via polylysine, with the ligand being specific for a receptor present on the surface of the target 

15 cells, is an example of a technique for specifically targeting nucleic acid to particular cells. 

The nucleic acids need to be delivered to the host cells can include, a first nucleic acid encoding a 
chimeric transcription factor as disclosed, and a second nucleic acid comprising a nucleotide sequence to 
be transcribed operatively linked to a transcription unit 

In one embodiment, the first and second nucleic acids are separate molecules, i.e., in two 

20 different vectors. In this case, a host cell may be cotransfected with the two vectors or successively 

transfected with the vectors. In another embodiment, the nucleic acids are linked (i.e., colinear) in the 
same molecule, i.e., in a single vector. In this case, a host cell may be transfected with the single nucleic 
acid molecule, which is preferred in gene therapy. 

In an alternative, a sequence to be transcribed may be endogenous to a host cell. An endogenous 

25 sequence may be operatively linked to an appropriate transcription unit by means of homologous 

recombination. For example, a homologous recombination vector can be prepared which includes a 
promoter sequence responsive to a chimeric transcription factor of the present invention, flanked at its 3' 
end by sequences representing the coding region of the endogenous gene and flanked at its 5' end by 
sequences from the upstream region of the endogenous gene by excluding the actual promoter region of 

30 the endogenous gene. The flanking sequences are of sufficient length for successful homologous 

recombination of the vector DNA with the endogenous gene. Preferably, several kilobases of flanking 
DNA are included in the homologous recombination vector. Upon homologous recombination between 
the vector DNA and the endogenous gene in a host cell, the endogenous promoter is replaced by the 
recombinant promoter. Thus, expression of the endogenous gene is no longer under the control of its 
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endogenous promoter but rather is placed under the control of the transcription unit in accordance with 
the present invention. 

In another embodiment, an operator sequence may be inserted elsewhere within an endogenous 
gene, preferably within a 5* or 3' regulatory region, via homologous recombination to create an 
endogenous gene whose expression can be regulated by a transcriptional activator or repressor described 
herein. For example, one or more binding sequences of a chimeric transcription factor of the present 
invention can be inserted into a promoter or enhancer region of an endogenous gene such that promoter or 
enhancer function is maintained. 

When muscle is the target tissue for gene therapy, direct intramuscular injection of either viral- or 
non- viral vectors is one of the preferred modes for transgene delivery in vivo. In particular, direct 
intramuscular injection of viral or non-viral vectors encoding: i) antigens from viruses, bacteria or 
protozoans result in the protection against a subsequent challenge with the corresponding pathogen; ii) 
tumor-specific antigens result in protection of mice against challenges with tumorigenic cells expressing 
the corresponding antigen; iii) secreted proteins result in delivery into the bloodstream (Marshall, D. J. 
and Leiden, J. M., 1998, Curr. Opin. Genet. Dev., 8, 360-365). 

A still further aspect provides a method of introducing the nucleic acid into a host cell in vitro. 
The in vitro introduction can be used for ex vivo gene therapy and the non-gene therapy methods 
discussed above. The introduction, which may (particularly for in vitro introduction) be generally 
referred to without limitation as "transformation", may employ any available technique. For eukaryotic 
cells, suitable techniques may include calcium phosphate transfection, DEAE-Dextran, electroporation, 
liposome-mediated transfection and transduction using viruses as disclosed above. In addition, viruses 
such as vaccinia, and baculovirus (for insect cells), can also be used. For bacterial cells, suitable 
techniques may include calcium chloride transformation, electroporation and transfection using 
bacteriophage. As an alternative, direct injection of the nucleic acid could be employed. Marker genes 
such as antibiotic resistance or sensitivity genes may be used in identifying clones containing nucleic acid 
of interest, as is well known in the art 

4. Suitable Composition 

Suitable compositions may be needed for the delivery of the polynucleotides encoding the gene 
switch to the target cells or tissues of the mammals to be treated, as well as the administration of the 
exogenous ligand to the mammals. 

A composition according to the present invention that is to be given to an individual, 
administration is preferably in a "prophylactically effective amount" or a "therapeutically effective 
amount" as the case may be, although prophylaxis may be considered therapy), this being sufficient to 
show benefit to the individual. The actual amount administered, and rate and time-course of 
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administration, will depend on the nature and severity of what is being treated. Prescription of treatment, 
e.g. decisions on dosage etc, is within the responsibility of general practitioners and other medical 
doctors. A composition may be administered alone or in combination with other treatments, either 
simultaneously or sequentially dependent upon the condition to be treated. 
5 Pharmaceutical compositions according to the present invention, and for use in accordance with 

the present invention, may include, in addition to active ingredient, a pharmaceutically acceptable 
excipient, carrier, buffer, stabiliser or other materials well known to those skilled in the art. Such 
materials should be non-toxic and should not interfere with the efficacy of the active ingredient. The 
precise nature of the carrier or other material will depend on the route of administration, which may be 

10 oral, or by injection, e.g. cutaneous, subcutaneous or intravenous. 

Pharmaceutical compositions for oral administration may be in tablet, capsule, powder or liquid 
form. A tablet may include a solid carrier such as gelatin or an adjuvant. Liquid pharmaceutical 
compositions generally include a liquid carrier such as water, petroleum, animal or vegetable oils, mineral 
oil or synthetic oil. Physiological saline solution, dextrose or other saccharide solution or glycols such as 

15 ethylene glycol, propylene glycol or polyethylene glycol may be included. 

For intravenous, cutaneous or subcutaneous injection, or injection at the site of affliction, the 
active ingredient will be in the form of a parenterally acceptable aqueous solution which is pyrogen-firee 
and has suitable pH, isotonicity and stability. Those of relevant skill in the art are well able to prepare 
suitable solutions using, for example, isotonic vehicles such as Sodium Chloride Injection, Ringer's 

20 Injection, Lactated Ringer's Injection. Preservatives, stabilisers, buffers, antioxidants and/or other 
additives may be included, as required. Liposomes, particularly cationic liposomes, may be used in 
carrier formulations. 

Examples of techniques and protocols mentioned above can be found in Remington's 
Pharmaceutical Sciences, 16th edition, Osol, A. (ed), 1980. . 
25 The composition may be administered in a localized manner to a tumor site or other desired site 

or may be delivered in a manner in which it targets tumor or other cells. 

Targeting therapies may be used to deliver the composition more specifically to certain types of 
cell, by the use of targeting systems such as antibody or cell specific ligands. Targeting may be desirable 
for a variety of reasons, for example if the agent is unacceptably toxic, or if it would otherwise require too 
30 high a dosage, or if it would not otherwise be able to enter the target cells. 

A composition may be administered alone or in combination with other treatments, either 
simultaneously or sequentially dependent upon the condition to be treated, such as cancer, virus infection 
or any other condition in which an effect mediated by activity of the fusion protein is desirable. 
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5. Host Cells 

Thus, a further aspect of the present invention provides a host cell containing heterologous 
nucleic acid as disclosed herein. 

The host cell can be, for example, a mammalian cell (e.g., a human cell), a yeast cell, a fungal 
5 cell or an insect cell. Moreover, the host cell can be a fertilized non-human oocyte, in which case the host 
cell can be used to create a transgenic organism having cells that express the transcriptional inhibitor 
fusion protein. 

According to an embodiment of the present invention, the mammalian host cell is a cell in a 
mammal, human or non-human. The nucleic acid is introduced to the cell via approaches of in vivo or ex 

10 vivo gene delivery. 

Hence, the invention is applicable to normal mammalian cells, such as cells to be modified for 
gene therapy purposes or embryonic cells modified to create a transgenic or homologous recombinant 
animal. Examples of cell types of particular interest for gene therapy purposes include hematopoietic 
stem cells, myoblasts, hepatocytes, lymphocytes, muscle cells, neuronal cells and skin epithelium and 

15 airway epithelium. Additionally, for transgenic or homologous recombinant animals, embryonic stem 

cells and fertilized oocytes can be modified to contain nucleic acid encoding a transactivator or repressor 
fusion protein. 

In addition to normal cells, the invention is applicable to cell lines. According to an alternative 
embodiment of the present invention, the mammalian host cell is a cultured cell, or a cell from a cell line. 
20 Examples of mammalian cell lines which may be used include CHO 30 dhfr- cells (Urlaub and Chasin 
(1980) Proc. Natl. Acad ScL USA 77:4216-4220), 293 cells (Graham et al. (1977) J. Gen. Virol. 36: pp 
59) and myeloma cells like SP2 or NSO (Galfre and Milstein (1981) Meth. Enzymol. 73(B):3-46). 

Preferably, the target cells or tissues do not contain endogenous active transcription factors that 
bind the binding site. For instance, the fact that HNF-1 is not expressed in muscles is of relevance for 
25 gene therapy purposes in accordance with the present invention. According to a preferred embodiment, 
the target cells are muscle cells, and the DNA binding domain of the chimeric transcription factor is from 
HNF-1. Thus, the gene of interest would not be stimulated by endogenous transcription factor. 

Nucleic acid encoding a chimeric transcription factor according to the present invention can also 
be transferred into a fertilized oocyte of a non-human animal to create a transgenic animal which 
30 expresses the transcription factor in one or more cell types. 

Aspects of the invention further provide non-human transgenic organisms, including animals, that 
contain cells which express chimeric transcriptional factor of the invention (i.e., a nucleic acid encoding 
the transactivator or repressor is incorporated into one or more chromosomes in cells of the transgenic 
organism). 
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The method for producing such transgenic cells is not particularly germane to the present 
invention and any method suitable for the target cell may be used; such methods are known in the art, 
including cell specific transformation. 



5 EXAMPLES 

Examples are provided below to further illustrate different features of the present 
invention. The examples also illustrate useful methodology for practicing the invention. These examples 
do not limit the claimed invention. 
Materials and Methods 

10 

Plasmid constructs 

The yeast expression vector pGBT9~GAL4DBD/ERa LBD/VP16AD was obtained by inserting a 

DNA cassette coding for the chimeric transcription factor into pET23b vector (Novagen) and then 

transferring the cassette into the pGBT9 yeast expression vector (Clontech). 
15 A HindHI-HincTl DNA fragment containing the coding sequence for the minimal GAL4 DNA- 

binding domain (DBD) (aa 1-93) was excised from the plasmid pAS2~l (Clontech) and inserted into 

pET23b vector digested with the same enzymes, thus generating pET-GAL4DBD. 

A Sacl-BamHl DNA fragment containing the coding sequence for the minimal VP 16 activation 

domain (AD) (aa 424-490) was excised from the plasmid pUHD172-lneo and inserted into pET- 
20 GAL4DBD digested with the same enzymes, thus generating pET-GAL4 DBD/VP16AD. In this plasmid 

the coding sequence for VP 16 AD was in frame downstream of the GAL4 DBD coding sequence and a 

two-ammo acid junction (TE) was introduced during cloning. 

The hERoc LBD coding sequence (aa 282-595) was obtained by PGR amplification using as 

template the plasmid phERoc/BSKS(-) containing the full-length hERoc ORF (1-595) and the following 
25 DNA primers: forward, 5 ' -GG AATTCGTTGACCGGGTCTGCTGG AGAC ATG-3 ' (SEQ ID NO: 45); 

reverse, 5 ' -GGAATTCGAGCTCTGAACC AGACCCGACTGTGGC A GGG AAACC-3 ' (SEQ ID NO: 

46). The obtained DNA fragment was digested with HincJL and Sacl and inserted into pET- 

GAL4DBD/VP16AD digested with the same enzymes, thus generating pET- 

GAL4DBD/hERa LBD/VP16AD. In this construct, the hERa LBD coding sequence was cloned in 
30 frame with the C-terminus of GAL4 DBD coding sequence through a two-amino acid linker (TG) and 
with the N-terminus of VP 16 AD to which it was joined by a GSGSE linker. 

The construct pGBT9-GAL4DBD/ERa LBD/VP16AD was obtained by excising the DNA 
cassette from pET-GAL4DBD/hERa LBD/VP16AD using Xhol and BamHl and cloning into the pGBT9 
vector digested with the same enzymes. 
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pGBT9-GAL4DBD/ERa LBD G(521)R/VP16AD was obtained by substituting the wt Ncol- 
ftspMI-digested DNA fragment contained in pET-GAL4DBD/hERa LBD/VP16AD with the same 
fragment carrying the mutated codon obtained from pGEX-ERa LBD G(521)R (see below) digested with 
the same enzymes. GAL4DBD/ERcc LBD G(521)R/VP16AD DNA-cassette was then transferred to the 
5 pGBT9 vector by Xhol and BamHl digestion, as described above. 

pGBT9- GAL4DBD/ERa LBD L(384)M7VP16AD was obtained by PCR-mediated site-directed 
mutagenesis using the wt construct as template and the following DNA primers: forward (Eagl,s), 5*- 
GTCCCTGACGGCCGACCAGATGG TCAGTGCCTTGTTGGATGCTGAGCCC-3' (SEQIDNO: 47); 
reverse [L(384)M Nco,as], 5 ' -GTGCTCCATGGAGCGCC AG ACGAGACC AATC ATC AGGATCTC 
10 CATCCAGGC-3' (SEQ ID NO: 48). The amplified mutated DNA fragment was digested with Eagl and 
NcdL and inserted into pGBT9-GAL4DBD/ERa LBD/VP16AD digested with the same enzymes. 

To introduce the G(521)R mutation in both pGBT9- GAL4DBD/ERa LBD L(384)M /VP16AD 
and pGBT9 GAL4DBD/ERoc LBD L(384)M, M(421)G/VP16AD a Stul-Sacl fragment containing the 
mutation was excised from pGBT9-GAL4DBD/ERa LBD G(521)R/VP16AD and inserted into the 
15 corresponding recipient vectors digested with the same enzymes. 

To clone the alternative mutations at the G-521 and H-524 positions of the selected mutant 
pGBT9- GAL4DBD/ERoc LBD L(384)M, M(421)G/VP16AD PCR-mediated site-directed mutagenesis 
was performed using the wt construct as a template and the Stul,s oligonucleotide as forward primer (5*- 
CAA.GGCAGGCCTGACCCTGCAGCAGCAGCACC-3 , ) (SEQ ID NO: 49) in combination with each of 
20 the following mutagenic reverse primers: GVJ BspMIas. 5'- 

GCATCTCCAGCAGCAGGTCATAGAGGGGCACCACGTTCTTG 

C ACTTCATGCTGTACAGATGCTCC ATCACTTTG-3 * (SEQ ID NO: 50); GL3spMI,as and GM, 
BspMI,as had the same sequence with the exception of the mutagenized triplet that was CAG and CAT, 
respectively; HVJBspMI,as, 5'- 
25 GCATCTCCAGCAGCAGGTCATAGAGGGGC^ 

GCTGTAC AGCACCTCCATGCCTTT-3 ' (SEQ ID NO: 51). Each of the mutagenized fragments was 
digested with Stul and BspMl and cloned into pET-GAL4 DBD/hERa LBDAHP16AD cut with the same 
enzymes. 

Each amino acid substitution was cloned into pGBT9-GAL4DBD/ER<x LBD L(384)M, 
30 M(42 1)G/ VP16AD by digesting each mutated pET- construct with Stul andBsmL and transferring the 
corresponding mutated fragments in the recipient vector digested with the same enzymes. 

To introduce the D(351)A mutation into pGBT9-GAL4DBD/ERa LBD L(384)M, M(421)G, 
H(524)VA^P16AD the suitable mutated DNA fragment was obtained by PCR-mediated mutagenesis 
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using as template pGBT9-GAL4DBD/ERoc LBD L(384)M/VP16AD plasmid and the forward primer 
(DA,Hind,s), 5 ' -CTTCAGTGAAGCTTCGATG ATGGGCTTACTGACCAACCTGGCAGCCAGGG -3' 
(SEQ ID NO: 52) coupled with the reverse primer L(384)M Nco.as (see above). The resulting fragment 
was digested with HindSR and Ncol and inserted into pGEX-ERa LBD L(384)M (see below) cut with the 
same enzymes. This latter construct was then digested with Eagl and Ncol and the restriction fragment 
was cloned into pGBT9- GAL4DBD/ER.0C LBD L(384)M, M(421)G, H(524)V/VP16AD digested with 
the same enzymes. 

Both plasmids pGEM-hERoc LBD and pGEX-ERa LBD contain the DNA sequence coding for 
hERcc LBD (aa 303-595) that in pGEX is in frame with the GST coding sequence. The construct 
pGEM(RI)-hERa LBD was obtained by inserting an EcoRl site in the polylinker of pGEM-hERa LBD 
between Notl and Sail sites downstream of the hERoc LBD coding sequence. The PCR reaction was 
performed using pGEM-hERa LBD as template and the following primers: forward, 5'- 
CTATGACCTGCTGCTGGAGATGCTGGACG-3' (SEQ ID NO: 53); reverse, S'-CATATGGTCGAC 
GAATTC GCGGCCGCAC-3 ' (SEQ ID NO: 54). The DNA fragment was digested with BspMl and Sail 
and inserted in pGEM-hERa LBD opened with the same enzymes. 

The construct pGEM(RI)-hERcx LBD G(521)R was obtained by PCR-mediated site-directed 
mutagenesis using pGEM(RT)-hERa LBD as template, Stul,s as forward primer (see above) and 
GR,BspMI,as (5'-GCATCTCCAGCAGCAGGT 

CATAGAGGX3GCACCACGTTCTTGCACTTCATGCTGTACAGATGCTCCATG^TTTG-3') (SEQ 
ID NO: 55) as reverse primer. The mutated DNA fragment was digested with Stul and BspMl and 
inserted into pGEM(RQ-hERa LBD digested with the same enzymes. 

The construct pGEX-ERcc LBD G(521)R was obtained by transferring the Ncol-EcoRl DNA 
fragment from pGEM-hERcc LBD G(521)R to pGEX-ERa LBD digested with the same enzymes. 

The construct pGEX-ERa LBD L(384)M was obtained by digesting the plasmid pGBT9- 
GAL4DBD/ERcc LBD L(384)M/VP16AD with Eagl and Ncol and inserting the restriction fragment 
containing the mutated codon into pGEX-ERa LBD digested with the same enzymes. 

The construct pGEX-ERa LBD L(384)M, M(421)I was obtained by PCR-mediated site specific 
mutagenesis using pGEX-ERa LBD L(384)M as template, the forward primer EagLs (see above) and the 
following reverse primer: 5'-GTCC AAGATCTCCACGATGCCCTCTACAC-3' (SEQ ID NO: 56). 
The DNA fragment was digested with Eagl and BgUl and inserted in pGEX-ERa LBD L(384)M cut with 
the same enzymes. 

Selected combinations of mutations were transferred from the corresponding pGBT9- 
GAL4DBD/ERa LBD L(384)M /VP16AD library vectors to the pGEX-ERa LBD plasmid by excision of 
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a DNA fragment coding for both the L(384)M substitution and the desired selected mutations using 
HindUL and Still and replacing the corresponding wt fragment in pGEX-ERa LBD digested with the same 
enzymes. 

Mammalian expression vectors coding for wt and mutated versions of GAL4DBD/ERa LBD 
5 L(384)M/VP16AD were obtained by digesting with Xhol and BainHl the corresponding pGBT9- 

constructs and cloning the restriction fragment in pM vector (Clontech) digested with the same enzymes. 

The 5GAL4UAS-pSEAP-reporter gene was constructed by digesting the plasmid pG5CAT 
(Clontech) with BamHl and EcoKL followed by filling-in with the Klenow enzyme. The restriction 
fragment containing five GAL4 UAS repeats and the Elb minimal promoter was then cloned upstream of 
10 the SEAP coding region into the pSEAP2-Basic plasmid (Clontech) digested with HindSR followed by 
Klenow filling-in. 

Plasmid pVlj/HEAm45.2 was obtained after several cloning steps. We first generated plasmids 
pHEAwt and pBS/ERwtLBD. To obtain pHEAwt, a fragment spanning aa 303-595 of the LBD of hERa 
was obtained by PCR amplification with primers S' -GATATC 
15 C AAGAAC AGCCTGGCCTTGTCCCTGACG-3 ' (SEQ ID NO: 57) and 5*- 

ACTAGTGAATTCG ACTGTGGCAGGGAAACCCTCTGCCTCCn -3* (SEQ ID NO: 58), using 
plasmid phERoc/BSKS(-) as a template. Digestion of the amplified fragment with enzymes Xbal and 
EcoRI released a fragment spanning aa 379-595 of the wt LBD. This was used as a substitute for the 
corresponding region of plasmid pHEA-1, which has been previously described (Roscilli et al., 2002 Mol 
20 Ther. 5:653-663), thus obtaining plasmid pHEAwt. 

From plasmid pHEAwt, the 303-595 region of the hERa wt LBD was excised as an EcoKV- 
EcoRI fragment and introduced into the plasmid pBlueScript//f i/wfUI-, in which the HindBI site had been 
removed by digestion with the enzyme HindSJI y filling-in by Klenow and re-ligation. The plasmid 
obtained was called pBS/ERwtLBD. 
25 Plasmid pBS/ERM45 was then constructed by inserting a HindHL-SiuI fragment from plasmid 

pGBT9-GAL4DBD/ER<xLBD L(384)M, M(421)G/VP16AD into plasmid pBS/ERwtLBD digested with 
the same enzymes. Plasmid pBS/ERM45 therefore contains hERa LBD (from aa 303 to aa 595) carrying 
the L(384)M, M(421)G mutations. This mutated LBD was then excised from pBS/Erm45 as an EcoKV- 
EcoRl fragment and used as a substitute for the wt LBD of plasmid pHEAwt, thus obtaining plasmid 
30 pVlj/HEAm45. 

Finally, a fragment spanning aa 1-406 of HEAm45 was obtained by digesting plasmid 
pVlj/HEAm45 with BgUI and introduced into plasmid pHEAl (Roscilli et al., 2002 Mol Ther. 5:653- 
663) digested with the same enzyme. The plasmid obtained was called pVlj/HEAm45.2. 

All DNA constructs were verified by automatic sequencing using suitable oligonucleotides. 
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Expression in bacteria and purification ofGST-ER LBD fusion proteins 

E. coli BL21 cells (CODONPLUS™ DE 3-RIL, Stratagene) were transformed with suitable 
pGEX-ERoc LBD expression plasmids. A 2-liter liquid culture derived from a single transformed 

5 bacterial colony was grown at 37° C to A^oo = 0.8 in M9 modified minimal medium (5 g/L glucose, 1 g/L 
ammonium sulphate, 100 mM potassium phosphate pH 7, 5 fiM biotin, 7 fiM thiamine, 0.5 % casamino 
acids, 0.5 mM MgS04, 0.5 mM CaCl2, 13 /*M FeS04, 50 mg/L ampicillin). It was then cooled to 18° C 

and induced with 600 fiM EPTG (isopropyl-b-thiogalactopyranoside) for 22 hours at 18° C. All 
subsequent operations were performed at 4°C unless otherwise indicated. 
10 Cells were harvested and disrupted with a Microfluidizer (Model 1 10-S) in 200 ml of a buffer 

containing 50 mM Tris-HCl pH 7.5, 0.5 mM EDTA, 10 % glycerol, 0.2 M NaCl, 0.4 % n-Dodecyl J5-D- 
maltoside (Calbiochem), 5 mM DTT, 1 mM PMSF, and COMPLETE (Boehringer) protease inhibitor 
mixture. Insoluble material was pelleted at 27,000 x g for 30 min in a Sorvall SS34 rotor. The clarified 
supernatant containing between 30 % and 50 % of the recombinant protein was either stored in aliquots at 

15 -80° C after shock freezing in liquid nitrogen to be used directly in in vitro binding assays or further 
purified. 

The supernatant was loaded on two connected 5 mi-High Trap GST-Sepharose columns 
(Pharmacia) pre-equilibrated in lysis buffer containing 50 mM Tris-HCl pH 8 and 0.1 % n-Dodecyl B-D- 
maltoside. The GST-fusion protein was eluted in the same buffer supplemented with 10 mM reduced 
20 glutathione and further purified on a Superdex 200 26/60 gel filtration column (Pharmacia) equilibrated 
with lysis buffer containing 0.1 % n-Dodecyl 6-D-maltoside. The peak fraction corresponding to the 
purified GST-ER LBD homodimeric form at a concentration of approximately 2 ptM was stored in 
suitable aliquots at -80°C after shock-freezing in liquid nitrogen. 

25 In vitro ligand-binding assays 

Determination of ligand affinity for the GST-ER LBD polypeptides was done by a competitive 
radiometric binding assay using tritium-labeled estradiol ( 3 [H]E2) (Amersham, 158 Ci/mmol, 558 
mCi/mg) as tracer. 

Microplates (Basic Hashplates, NEN) wells were coated for 12 hrs at 4° C with 100 of PBS 
30 containing anti-GST antibodies (Amersham) at a concentration of 5 Jig/ml. After washing three times 
with 200 (xl of PBS, the background was reduced by saturating with 200 jil of PBS containing 1 % BSA 
for 3 hrs at 4° C. Wells were then washed three times with 200 pi of lysis buffer containing 0. 1 M NaCl 
and 0.2 % n-Dodecyl B-D-maltoside (assay buffer). 
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Suitable amounts of crude E. coli supernatants containing the GST-ER LBD proteins (2-10 or 
of purified polypeptides (8 nM) were bound to the anti-GST antibody-coated wells for 1 hr at 23° C in 
200 ill of the assay buffer with constant agitation. After three more washes with 200 jliI of the same 
buffer, the ligand-binding reaction was set up in 195 \il of assay buffer containing 2 nM-10 nM 3 [H]E2 
5 and 5 ^1 of DMSO or of suitable dilutions of the test compounds in DMSO. Incubation was for 2 hrs at 
23° C with constant shaking, followed by SPA radioactivity measurement using a microplate scintillation 
and luminescence counter (Top Count NXT, Packard). IC50 values were obtained by multiparameter 
logistic fitting of the experimental data with the aid of a Kaleidagraph software. 

10 Yeast strains and growth conditions 

Yeast strain CG-1945 (Clontech) was used as a reporter host strain for the display and screening 
of the library . It contains both the HIS3 and lacZ reporter genes under the control of a GAL4-responsive 
UAS integrated into the genome. Yeast strain Y187 (Clontech) was instead used for quantitative 
transcriptional 0-galactosidase assays. It contains only the lacZ reporter gene (probably present in two 
15 copies), which is expressed at higher levels than in CG-1945 for being under the control of the natural 

intact GAL1 promoter instead of the synthetic UAS G n-merW) consensus sequence. Both strains are gal4" 
and gal80" and were propagated at 30°C in YPD medium- 
Yeast strains transformed with pGBT9 plasmids were grown and stored in SD minimal medium 
(Clontech) supplemented with -Trp amino acid mixture. Growth selection for the nutritional reporter 
20 IHS3 was performed on agar plates in SD minimal medium supplemented with —His/— Trp amino acid 

mixture and either 15-30 mM (single plasmid transformants) or 70 mM (library clones pool) of 3-amino- 
1,2,4-triazole (3-AT), a competitive inhibitor of the HIS3 protein. 

In vivo (3-galactosidase assays were performed on agar plates in SD minimal medium 
supplemented with -Trp amino acid mixture, X-gal (80 mg/L) and lx BU salts (lOx = 70 g of 
25 Na2P0 4 7H 2 O and 30 g of NaHbPO^ pH 7). Growth selective and X-gal plates were supplemented with 
estradiol, tamoxifen (Sigma), tetrahydrofluorenone compounds or DMSO at the indicated concentrations. 

Transformation of yeast cells 

Yeast cells were transformed using the Li Ac procedure and the YEASTMAKER Yeast 
30 transformation kit (Clontech) following the manufacturer protocol. 

For single plasmid small-scale transformations 0.1 jig of DNA were used to transform 0. 1 ml of 
yeast competent cells resulting in transformation efficiencies of approximately 10 5 colony-forming units 
(cfu) per jig of DNA. 
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To set up transformation conditions for homologous recombination of the pGBT9- 
GAL4DBD/BRa LBD L(384)M /VP 16 AD with mutagenized PCR products, the expression vector was 
linearised by restriction digestion with BgUl and half of it was treated with Klenow polymerase prior of 
gel purification. 100 ng of each of the two versions of the digested plasmid were then transformed as 
5 outlined with 200 ng, 300 ng or 400 ng of a 160 bp PCR fragment obtained using wild-type 

oligonucleotides corresponding to the degenerated oligonucleotides designed to construct the mutated 
library. 

Co-transformation of 100 ng of linearised blunt-ended vector with 300 ng of PCR fragment 
showed an efficiency of approximately 10 5 per \xg of DNA that was 100-fold higher than background 
10 with the recipient vector alone. This condition was used for the library-scale transformation procedure in 
which 60 \ig of the mutagenized fragment collection and 20 ^ig of linearised recipient vector were used to 
transform 1 ml of yeast cells with an estimated efficiency of 5 x 10 4 cfu per fig of DNA. 

Library construction and selection 

15 Degenerate oligonucleotide mixtures were synthesised by a split-pool strategy. At each of the 

positions corresponding to Leu391, Phe404, Met421, Ile424 or Leu428, the previously synthesized 
column material was split into 10 individual pools, the 10 possible codons were synthesized separately 
and the pools mixed again together. Codons used were: GGC(Gly), GCC(Ala), TGC(Cys), GTG(Val), 
ATC(Ile), CTG(Leu), ATG(Met), TTC(Phe), TAC(Tyr), TGG(Trp). 

20 A slightly higher quantity of pooled column material (15% of total) was used to synthesize the wt 

codon thus leaving 9.4 % for each of the other 9 remaining codons. This "wt-spiking" increased the 
relative frequency of clones with low number of substitutions (1 or 2) with respect to clones having 4 or 5 
substitutions that represent the majority of the library clones. Thus, the likelihood that single- or double- 
mutation clones would be lost during the synthesis, construction or selection of the library should be 

25 significantly decreased. While in a homogeneous " 10 %-library M the 45 possible combinations of single- 
substitution clones would have represented only 0.045 % of the total library, the use of 15 % wt-codon 
increases this fraction to 0.2152 %, i.e. the clones were about 5-fold as frequent represented in the library. 
Consequently, the fraction of clones with 5 substitutions (59049 possible combinations) dropped from 
about 59 % to about 44 %. 

30 Preparative amounts of mutated LBD fragments were synthesised by PCR amplification of 6 \ig 

of pGBT9-GAJL4DBD/hERa LBD/VP16AD DNA template by including 2. 1 nmol of each of the 
degenerated oligonucleotide mixes in a total reaction volume of 6 ml containing 250 mM dNTPs, 5 % 
DMSO, 600 jxl of Pfii 10 x buffer and 300 units of Pfu polymerase (Stratagene). The PCR amplification 
consisted of 25 cycles at 95° C for 1 min, 65° C for 1 min and 72° C for 2 min. 
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60 pg of the mutagenized 160 bp product mixture were purified on QIAquick spin columns 
(Qiagen) and used in a scaled-up co-transformation experiment as outlined together with 20 pg of 
linearised recipient vector. 30 ml of the co-transformation reaction were spread on twenty 23-cm x 23-cm 
-Trp selective plates, colonies were harvested after 3 days of growth at 30° C and 1 ml-glycerol stocks of 
5 the amplified library were made. Suitable dilutions of the co-transformation mixture were spread on 100- 
mm plates to control the efficiency of homologous recombination and to determine the library titer (5 x 
10 4 cfu per fig of DNA). Glycerol stocks were also titered and resulted to contain 1.2 x 10 5 cfu per pi. 1 x 
10 s cfu were spread on each of twenty 100-mm -His/-Trp selective plates containing 70 mM 3-AT and 
either 1 pM (first experiment) or 10 pM (second experiment) of CMP1 (a total of 2 x 10 6 cfu per 

10 experiment were plated). 

After 6 days of growth at 30° C the average number of His + colonies per plate was 200 in the first 
experiment and 60 in the second experiment. 80 colonies out of 200 of the first screening and all colonies 
of the second screening were streaked out from each plate on X-gal 100 mm plates containing either 
DMSO or CMP1 at the concentration of 1 pM or 10 pM, respectively (a replica on -Trp master plates 

15 was also performed). After 3 days of growth at 30°C all clones that resulted blue in the presence of the 
compound and white on control plates were streaked out of corresponding master plates on X-gal plates 
containing either DMSO or CMP1 at a 10-fold lower concentration and on -Trp plates. This procedure 
was reiterated down to a compound concentration of 0.1 pM at which 28 and 31 independent positive 
clones, respectively, were collected for further analysis. 

20 

Plasmid rescue from yeast 

Colonies corresponding to the mutants of interest were grown to saturation at 30° C for 16 hrs in 
2 ml of -Trp SD minimal medium. Cells from a 1.3 ml culture fraction were collected by centrifugation 
and resuspended by vortexing in 0.2 ml of protoplasting buffer (100 mM Tris-HCl pH 7.5, 10 mM EDTA, 

25 14.4 mM 0~mercaptoethanol) containing 400 pi of a 40 units/pl Lyticase (Sigma) solution. After cell 
walls were dissolved by incubation for 2 hr at 37° C, 200 pi of lysis solution (0.2 M NaOH, 1 % SDS) 
were added and samples were incubated at 65° C for 20 min, then put rapidly on ice. 

Samples were mixed with 200 pi of 3 M K-acetate pH 5.4, incubated on ice for 15 min, and spun 
for 3 min at 13,000 rpm in an Eppendorf microcentrifuge. Plasmid DNA was recovered from 

30 supernatants by precipitation with 0.6 volumes of isopropanol. Rescued plasmids were both transformed 
in electro-competent E. coli DH12S cells and directly sequenced by PCR-amplification of a 360 bp 
fragment using oligonucleotide primers (S'-CTGACCAACCTGGCAGACAG-S' (SEQ ID NO: 59); 5'- 
GG ACTCGGTGGAT ATGGTCC-3 * (SEQ ID NO: 60)) annealing 100 bp upstream and downstream of 
the mutagenized insert, respectively. The amplified fragments were purified on QIAquick spin columns 
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and subjected to automatic sequencing using either of two sequencing primers (5'- 

GTTCACATGATCAACTGG GCG-3* (SEQ ID NO: 61); 5 * -G AG ACTTC AGGGT GCTGGAC-3' (SEQ 
ID NO: 62) that annealed 70 bp from the mutagenized insert boundaries. 

Yeast protein extracts and Western blot analysis 

The Urea/SDS method was followed to prepare yeast protein extracts suitable to evaluate mutant 
protein expression by Western blot analysis using anti VP16 AD polyclonal antibodies (Santa Cruz). 

Quantitative fi-galcictosidase assays 

The assays were performed according to Clontech yeast protocol handbook. Single colonies were 
grown to saturation for 16 hrs at 30° C in -Trp SD minimal medium, cells were collected by 
centrifugation and then diluted in YPD medium to an optical density of 0.04 at 600 nM. Subcultures of 5 
ml-volume were set up and allow to grow for 7 hrs at 30° C in the presence of DMSO or various ligand 
concentrations until they reached mid-log phase (OD 60 o=0.4-0.5). 

Cells from 1.5 ml of culture (two duplicates for each sample) were pelleted, washed in lacZ 
buffer (10 mM KC1, 1 mM MgS0 4 , and 100 mM phosphate pH 7), and resuspended in 300 jjlI of lacZ 
buffer. 25-100 |ml of the suspension were lysed by three freeze/thaw cycles, mixed with 0.7 ml of lacZ 
buffer containing 50 mM P-mercaptoethanol and the enzymatic reaction was started by the addition of 
160 fil of a 4 mg/ml ONPG (Sigma) solution in lacZ buffer. 

The reaction was performed at 30° C until the yellow color developed and was stopped by the 
addition of 0.4 ml of 1 M Na 2 C0 3 , centrifuged and quantified by reading the optical density at 420 nm. 
Units of P-galactosidase were then defined as (1000 x OD 420 ) divided by [assay duration in min x (0.1 ml 
x concentration factor) x ODeooL Dose-response data were analyzed using a non-linear regression 
analysis with the aid of a Kaleidagraph software. 

Cell culture, trans fections and SEAP assays 

All cell culture experiments were performed using phenol red-free Dulbecco's modified Eagle* 
medium (DMEM, Gibco BRL) supplemented with 10% dextran charcoal-treated (Sigma) fetal bovine 
serum (FBS, GibcoBRL). Twenty-four hours prior transfection HeLa cells were seeded at a density of 
200,000 cells per well in 6-well plates. Transfections were performed using FuGENE 6 Transfection 
reagent (Roche) according to the manufacturer's instructions. 

The cells were transfected with 1 \ig of the reporter plasmid 5GAL4UAS-pSEAP, 0.1 \ig pCMV- 
Luc as internal control, variable amounts of pGBT9-GAL4DBD/ER LBD/VP16AD expression vectors 
and variable amounts of carrier pSEAP2-Basic DNA to normalize all samples to 2 \ig of total plasmid 
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DNA. DNA was mixed to 6 ^il of FuGENE Reagent diluted in 100 jil of serum-free mediumand added 
to cells. Seven hours later cells were provided with fresh medium and 24 hours after transfection the 
different ligands (or DMSO vehicle) were added at the indicated concentrations in fresh medium. 

Cells were then grown for 24 hours in the presence of the inducers and submitted to protein 
5 extraction for luciferase assays while supernatants were collected and processed for the detection of 
secreted alkaline phosphatase (SEAP). SEAP activity was evaluated using a commercially available 
assay (Tropix Phospha-Light system) following the manufacturer's guidelines. 

Reactions were performed in microplates and SEAP activity was measured as light emission 
using a microplate scintillation and luminescence counter (Top Count NXT, Packard). Values were 

10 subtracted of the background value obtained by measuring endogenous alkaline phosphatase in un- 
transfected cell medium. SEAP values were normalized for differences in the transfection efficiency, 
which was determined on the basis of luciferase activity. Then they were converted to SEAP 
concentration values (ng/ml) by comparison with standard activity curves obtained with purified human 
placental alkaline phosphatase (Sigma). 

15 To evaluate the effect of compounds on the m45.2 chimera transcriptional activity, HeLa cells 

were seeded 18 hrs before transfection in 6-well plate (3xl0 5 cells/well), and then transfected with 1 fig of 
plasmid DNA per well (0.5 jxg transactivator + 0.5 |Jig reporter) by using Lipofectamine (Gibco), 
according to manufacturer's instructions. 100 ng of the luciferase reporter plasmid were included in the 
transfection mixture as an internal control for transfection efficiency. At six hours after transfection, the 

20 culture medium was changed and cells were treated or not with the various ligands. After additional 24 
hours, the medium was harvested and analyzed for the expression of human SEAP, as described above. 
SEAP levels were normalized against the luciferase activity measured in cell extracts. Dose-response 
data were analyzed as outlined above. 
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Example 1 

Synthesis of 9a-Benzvl -7-Hvdroxv^-Methv1-l -2.9.9a-Tetrahvdro-3fe-Fluoren-3-One (CMP4^ 




1. EVK, NaOMe, 
MeOH, 60°C 

2. HOAc, 6N HCI, 
v 80°C 




Step 1: 5-methoxv-2-benzvlidene-l-indanone 

To a stirred suspension of 5-methoxyindanone (lg, 5.89 mmol) and benzaldehyde (0.79 g, 7.4 
mmol) in EtOH (6 mL) at room temperature was added KOH (0.6 g). After 20 h the formed precipitate 
was filtered off, washed with EtOH and dried under high vaccum to afford the title compound (1.41 g). 

1HNMR (DMSO-d* , 300 MHz) 5 3.90 (s, 3H), 4.10 (s, 2H), 7.04 (m, 1H), 7.12 (m, 1H), 7.42- 
7.58(m, 4H), 7.72-7.83 (m, 2H). 

Step 2: 5-methoxv-2-benzvl- 1 -indanone 

A suspension of the product from step 1 (777 mg) and palladium on charcoal (10%, 78 mg) in 
EtOAc was stirred under a hydrogen atmosphere (0.4 bar) at 40°C. After 3 h the catalyst was filtered off 
and the solution was concentrated to dryness under reduced pressure. After flash chromatography on 
silicagel of the crude product 554 mg of the tide compound were obtained as a white solid. 
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lH NMR CDMSO-d^ , 300 MHz) 8 2.60-2.78 (m, 2H), 2.93-3.18 (m, 3H), 3.82 (s, 3H), 6.92-6.98 
(m, 1H), 7.02 (m, 1H), 7.15-7.30 (m, 5H), 7.58 (d, J = 2.5 Hz, 1H). 

Step 3: 7-me1hoxv-9a-benzvl-L2.9^ 

5 To a solution of 5-methoxy-2-benzyl-l-indanone (0.7 g, 2.78 mmol) and ethyl vinyl ketone (293 

mg) in anhydrous methanol (5.5 mL) were added 0.97 mL of a 0.5 M solution of NaOMe in MeOH. The 
mixture was stirred and heated to 60°C for lh. It was then concentrated to dryness under vacuum and the 
residue was treated with a 1: 1-mixture of HOAc/6 N HC1 (30 mL) at 80°C for 3 h. After cooling to room 
temperature the mixture was diluted with EtOAc and washed with sat. NaHC0 3 . The organic phase was 
10 dried over Na 2 S0 4 , filtered and concentrated to dryness under vacuum to afford a yellow oil, which was 
chromatographied by flash chromato-graphy (silica gel, eluent CH. 2 Cy 2% EtOAc). The collected 
product fractions were concentrated to dryness under vacuum and the residue was triturated with Et20/ 
petroleum ether to afford 339 mg of the title compound as a white solid. 

*H NMR (CDC1 3 , 400 MHz) 5 1.90-2.02 (m, 1H), 2.15 (s, 3H), 2.16-2.25 (m, 1H), 2.50-2.62 (m, 

15 3H), 2.77-2.93 (m, 2H), 3.08 (d, J=16.1 Hz, 1H), 3.88 (s, 3H), 6.83-6.90 (m, 2H), 7.05-7.12 (m, 2H), 
7.19-7.30 (m, 3H), 7.67 (d, J= 9.3Hz, 1H). 

Step 4: 7-hvdroxv-9a-benzvl-1.2,9.9a-tetrahvdro-3ff-fluoren-3-on 

To a solution of 7-methoxy-9a-benzyH,2,9,9a-tetrahydro-3i/-fluoren-3-one (279 mg, 0.877 
20 mmol) in CH2CL2 (10 mL) at -78°C was added 1M BBr 3 in CH2CI2 (2.6 mL, 2.6 mmol). The cooling 
bath was removed and the solution was stirred at room temperature for 2 hours. The solution was diluted 
with EtOAc (20 mL), washed with IN HC1 (20 mL), dried (Na 2 S0 4 ), filtered and concentrated under 
reduced pressure. After flash chromatography on silicagel of the crude product 190 mg of the title 
compound were obtained as a pale yellow solid. 

25 lH NMR (DMSO-d 6 , 400 MHz) 5 1.85-2.07 (m, 5H), 2.80-2.90 (m, 1H), 2.45 (m, 2H, in part 

overlaid with DMSO-signal), 2.75-3.00 (m, 3H), 6.70-6.80 (m, 2H), 7.05-7.12 (m, 2H), 7.15-7.28 (m, 
3H), 7.57 (d, J=8.2Hz, 1H). 
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Example 2 

Synthesis of 9a-(4-CMorobenzvlV7-H^ 

Tetrahvdro-3ft-Huoren-3-One (CMP8) 




p-Chloro- 
benzaldehyde 

KOH, EtOH 





1. MVK, NaOMe, 
MeOH, 60°C 

2. HOAc, 6N HCI, 
80°C 



BBr3, CH2CI2 
-78°C to r.t. 



MOMO- 




SnBu 3 



Pd(PPh 3 ) 4 , PhMe 



MOMQ 





1. HCI, MeOH, 60°C 



.O 2. 




/ 



N- 



DIAD, PPh 3 , THF 
3. AICI 3 , HSZ-Prop 
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Step 1: 2-(4^MorobenzrvUdeneV5-methoxv-l-indanone 

To a stirred suspension of 5-methoxyindanone (1.53 g, 9.01 mmol) and 4-chlorobenzaldehyde (2 
g, 14.2 mmol) in EtOH (40 mL) at room temperature was added a 0.5 M solution of sodium methanolate 
5 in methanol (5.6 mL). After 3 h the formed precipitate was filtered off, washed with EtOH and dried 
under high vaccum to afford the title compound 2.37 g of the title compound as a light brown solid. 

!h NMR (DMSO-d 6 . 300 MHz) 5 3.88 (s, 3H), 4.07 (s, 2H), 7.03 (m, 1H), 7.17 (m, 1H), 7.44 
(m, 1H), 7.55 (m, 2H), 7.70-7.82 (m, 3H). 

10 Step 2: 2-(4-chloroben2rvlV5-methoxv-l--indanone 

To a suspension of selenium (180 mg, 2.3 mmol) in anhydrous ethanol (5 mL) was added sodium 
boronhydride (96 mg, 2.6 mmol) at 0°C. The suspension was stirred for 20 minutes at 0°C and the 
formed clear solution was added to a suspension of 2-(4-cMorobenzylidene)-5-methoxy-l-indanone 
(451 mg, 1.57 mmol) in anhydrous tetrahydrofurane under a nitrogen atmosphere. The mixture was 

15 stirred and heated to 50°C for 2 hours. After cooling to room temperature the mixture was partitioned 
between 1 M KH 2 P0 4 (150 mL) and EtOAc. The organic phase was filtered, dried over Na 2 S0 4 and 
filtered again. After concentration to dryness under reduced pressure 428 mg of the title compound as a 
brown oil were obtained. 

*H NMR (CDC1 3 , 400 MHz) 8 2.65-2.80 (m, 2H), 2.91-3.00 (m, 1H), 3.12 (m, 1H), 3.32 (m, 
20 1H), 3.86 (s, 3H), 6.73 (s, 1H), 7.91 (m, 1H), 7.13-7.30 (m, 4H), 7.71 (d, J=7.9 Hz, 1H). MS: m/z 
(M+H*): 287.0. 

Step 3: 9a-f4-chlorobeazvlV7-methoxv-L2>9>9a-tetrahvdro-3g-fluoren-3-one 

To a solution of 2-(4-chlorobenzyl)-5-methoxy- 1 -indanone (8.27 mmol) in 50 mL anhydrous 

25 tetrahydrofurane under a nitrogen atmosphere were added methyl vinyl ketone (1.345 g, 19.2 mmol) and 
DBU (0.285 mL, 1.9 mmol). The mixture was stirred at 55°C for 20 hours. After cooling to room 
temperature the mixture was partitioned between 1 M KH 2 P0 4 and EtOAc. The organic phase was dried 
over Na 2 S0 4 and filtered. After concentration to dryness under reduced pressure 2.66 g of a crude oil 
were obtained, which was redissolved in 4 mL of anhydrous tetrahydrofurane. Pyrrolidine (0.62 mL, 

30 7.4 mmol) and acetic acid (0.43 mL, 7.5 mmol) were added and the mixture stirred and heated to 55°C for 
4 hours. After cooling to room temperature the mixture was partitioned between 1 M KH 2 P0 4 and 
EtOAc. The organic phase was washed with 1 M Na2HP0 4 , dried over Na 2 S0 4 and filtered. After 
concentration to dryness under reduced pressure 2.11 g of the crude title compound were obtained as an 
oil and was used without further purification in the next step. 

35 MS: m/z (M+H 1 "): 339.0. 
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Step 4: 4-bromo-9a^4<:MorobenzvlV7"methoxv-L2,9.9a'telTahvdro>3Jy-fluoren-3-one 
A solution of crude 9a-(4-cMorobenzyl)-7-methoxy-l,2,9,9a-teti^ 

(2.11 g, ca. 6.2 mmol) in CCI4 (12 mL) was treated with solid NaHC03 (2.6 g, 31 mmol). The mixture 

was cooled in an ice bath and rapidly stirred while bromine (0.322 mL, 6.3 mmol) was added over 6 
minutes. After stirring for 1.5 hours at 0°C, the mixture was partitioned between CH2CI2 (300 mL) and 

water (300 mL). The aqueous phase was extracted with CH2CI2 (100 mL) and the combined organic 

phases were dried over Na 2 SC>4, filtered, and concentrated to dryness under reduced pressure. After flash 

chromatography on silicagel of the crude product (eluent: petrolether/ EtOAc, 5:1) 683 mg of the title 
compound were obtained as a pale yellow solid. 

*H NMR (CDCI3, 400 MHz) 8 2.05-2.25 (m, 2H), 2.63-3. 10 (m, 6H), 3.89 (s, 3H), 6.78 (s,lH), 

6.85-7.22 (m, 5H), 8.45 (d, J=8.6Hz, 1H). 
MS: m/z (M+lT): 416.6/419.0. 

Step 5: 9a-(4-chlorobenzvl)-7-methoxv-4-(4-methoxvmethoxv-phenvl)-l ,2.9.9a-tetrahydro-3/Z-fluoren- 
3-one 

A mixture of 4-bromcK9a^4-cMorobenzyl)-7-meth^ 
(683 mg, 1.64 mmol), Pd(PPht3)4 (900 mg, 0.777 mmol), and tributyl-(4-methoxymethoxy-phenyl)- 

stannane (861 mg, 2.02 mmol) in anhydrous toluene (12 mL) was placed under a nitrogen atmosphere and 
heated with stirring in an oil bath at 100°C. After 4 days, the mixture was cooled to room temperature 
and evaporated under vacuum to a dark oil (2.208 g). This material was purified by flash chromatography 
on silica gel, using 4: 1 petroleum ether-EtOAc as elutent, to afford 9a-(4-chlorobenzyl>7-methoxy-4«(4- 
methoxymethoxy-phenyO-l^^^a-tetrahydro-SH-fluoren-S-one (140 mg) as a yellow oil. 

*H NMR (CDCI3, 400 MHz) 6 2.12-2.31 (m, 2H), 2.63-2.80 (m, 3H), 2.82-2.96 (m, 1H), 2.98- 

3.10 (m, 2H), 3.55 (s, 3H), 3.80 (s, 3H), 5.25 (m, 2H), 6.40 (m, 1H), 6.49 (m, 1H), 6.73 (s, 1H), 7.02-7.70 
(m,8H). 

Step 6: 9a-f4-cMorobenzrvl)-4^4-hvdr^ 

A solution of 9aK4-chloroben2yl)-7-methoxy-4-(4-methoxy^ 
3//-fluoren-3-one (140 mg, 0.295 mmol) in methanol (12 mL) was warmed in an oil bath at 60°C and 
treated with aqueous 2N HC1 (2.5 mL). The resulting mixture was stdired and heated at 60°C for two 
hours, then cooled to room temperature and concetrated under reduced pressure. The residue was 
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partitioned between EtOAc and 1M Na 2 HP0 4 . The organic phase was dried over Na 2 S0 4 , filtered and 
concentrated under vacuum to leave 120 mg of the title compound as a yellow oil. 

*H NMR (CDCI3, 400 MHz) 5 2.12-2.23 (m, 1H), 2.24-2.33 (m, 1H), 2.62-2.80 (m, 3H), 2.83- 

3.08 (m, 3H), 3.78 (s, 3H), 6.40-6.51 (m, 2H), 6.72 (s, 1H), 6.80-7.00 (m, 4H), 7.05 (d, J=8.8 Hz, 2H), 
5 7.23(d,d,J=8.8Hz,2H). 

Step 7: 9a-(4-cMorobenzrvD-4-r4-f2-piperidine-l-vlethoxv)phenvl1-7-methoxv-L 
fluoren-3-one 

To a solution of 9a-(4^Morobenzyl)-4-(4-hydroxyphenyl)-7-methoxy-l,2,9,9a-tetxahy 
10 fluoren-3-one (120 mg, 0.28 mmol), triphenylphosphine (220 mg, 3 eq.) and piperidineethanol (1 12 yL, 
3 eq.) in 1.6 mL of anhydrous tetrahydrofurane at 0°C was added dropwise a solution of diisopropyl- 
azodicarboxylate in anhydrous tetrahydrofurane (175 |iL, 3 eq., in 600 pL) at 0°C. The solution was 
stirred for 1 h at 0°C and for 6 h at room temperature. The product was purified by flash chromatography 
on silica gel, using 2:1 EtOAc-MeOH as elutent. After removal of the solvents under reduced pressure 
15 96 mg of the title compound were obtained as a yellow oil. 

*H NMR (DMSO-ck , 400 MHz) 5 1.30-1.55 (m, 6H), 2.14 (m, 2H), 2.45 (m, 4H, in part covered 
by solvent signal), 2.67 (m, 4H), 2.73 (m, 1H), 2.86 (m, 1H), 3.02 (m, 2H), 3.70 (s, 3H), 4.10 (t, 
J=6.5 Hz, 2H), 6.21 (d, J=8.0 Hz, 1H), 6.50 (m, 1H), 6.82 (s, 1H), 6.95 (s, br, 4H), 7.15 (d, J=7.0 Hz, 2H), 
7.25 (d,J=7.0Hz,2H). 

20 

Step 8: 9a-(4-cMorobenzvD^- l4-r2-(l-piperidinvnethoxv1phenvll-7-hv^^ 
fluoren-3-one 

A solution of 9a<4-cWorobenzyl)-7-methoxy^{4-[2-(l-piperidinyl)-ethoxy]phenyl}-l,2,9,9a- 
tetrahydro-3//-fluoren-3~one (96 mg, 0.177 mmol) in anhydrous CH2CI2 (4 mL) was placed under a 

25 nitrogen atmosphere and cooled to 0°C. By a syringe a solution of A1C1 3 (215 mg, 1.61 mmol) in 2- 
propanethiol (2 mL) was added. The resulting mixture was stirred for 3 h at room. The solvents were 
removed by a stream of nitrogen and MeOH (1 mL) was added to the residue. The resulting mixture was 
partitioned between saturated aqueous NaHC0 3 and EtOAc. The organic phase was dried over NaS0 4 , 
filtered and evaporated under vacuum to afford 9a-(4-chlorobenzyl)-7-hydroxy-4-{4-[2-(l- 

30 piperidinyl)ethoxy]phenyl}--l^,9,9a-tetrahydro-3/f-fluoren-3-one as a yellow solid. 

*H NMR (DMSO-do , 400 MHz) 8 1.33- L42 (m, 2H), 1.45-1.55 (m, 6H), 2.12 (m, 2H), 2.45 (m, 
4H, in part covered by solvent signal), 2.60-2.70 (m, 4H), 2.73 (m, 1H), 2.85 (m, 1H), 2.97 (d, J=15 Hz, 
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2H), 4.10 (t, J=6.5 Hz, 2H), 6.12 (d, J=8.0 Hz, 1H), 6.30 (m, 1H), 6.61 (s, 1H), 6.95 (s, br, 4H), 7.14 (d, 
J=7.0 Hz, 2H), 7.24 (d, J=7.0 Hz, 2H). 
MSim/zCM+H^): 528.2. 

5 Example 3 

Synthesis of (3gV9a-Benzvl- 7-Hvdroxv-4-Methvl-12^ Mftthnxime 




10 A solution of 9a-benzyl-7-hydroxy-4-methyl-l,2,9,9a-te^ (50 mg, 0. 16 

mmol, prepared as described in example I) and O-methylhydroxylamine hydrochloride (27 mg, 
0.32 mmol) in a 1: 1-mixture of EtOH and anhydrous pyridine (0.5 mL) was stirred at room temperature 
for 16 hours. The reaction mixture was diluted with CH 2 C1 2 , washed with IN HC1 and brine, dried over 
Na 2 S04, filtered, and evaporated under vacuum. The title compound was obtained as a colourless solid 

15 (25 mg)... 

lH NMR (DMSOndg, 400 MHz) 5 1.42-1.56 (m, 1H), 1.92 (m,lH), 2.09 (s, 3H), 2.34 (m, 2H), 2.53-2.70 
(m, 2H), 2.78-2.90 (m, 2H), 3.87 (s, 3H), 6.66 (m, 2H), 7.04 (m, 2H), 7.14-7.26 (m, 3H), 7.43 (d, 
J=8.3Hz, 1H). 
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Example 4 

Synthesis of 6-Methv1-9a ^4-Huorobenzvn -8,9.9a. 10-Tetrahvdroindenor2.1^Iiidazol-7r3/iVOne 




F 



NO + BF 4 -, CH 2 CI 2 

KOAc, dibenzo-18-crowi>6 




F 



-62- 



WO 2005/040212 



PCT7EP2004/011683 



Step 1: 5-fac ety1aniino )^-bromo-l-indanone 

A suspension of N-(l-oxo-23-dihydro-lJ?-inden-5-yl)acetamide (1.50 g, 7.93 mmol) and Af- 
bromosuccinimide (1.47 g, 8.26 mmol) in 8 mL acetonitrile was stirred and heated to 60°C for 7 h. After 
5 cooling to room temperature a formed precipitate was filtered off and dried under vacuum to afford the 
title compound (1.28 g). 

l H-NMR (DMSO-d 6 , 400 MHz): 8 2.16 (s, 3H), 2.68 (m, 2H), 3.01 (m, 2H), 7.61 (d, J=8.0 Hz, 
1H), 7.83 (d, J=8.0 Hz, 1H), 9.62 (s, 1H). 

10 Step 2: 5-( acetvlamin o)-4>methvM-indanone 

A suspension of 5-(ac^tylamino)-4-bromo-l-indanone (8.34 g, 31.1 mmol), 
dichlorobis(tripenylphosphine)palladium(II) (1.09 g, 1.38 mmol), tetramethyltin (4.3 mL, 1 eq.), 
triphenylphosphine (0.82 g, 1 eq.) and lithium chloride (2.63 g, 2 eq.) in 83 mL of anhydrous 
dimethylformamaide was stirred and heated to 100°C under a nitrogen atmosphere for 6 h. After cooling 

15 to room temperature the mixture was filtered over celite and concentrated under reduced pressure. The 
residue was dissolved in CH 2 Cl 2 -5%MeOH and the solution was washed with brine. After concentration 
under reduced pressure the residue was triturated with diethyl ether and left under high vacuum to afford 
4.6 g of the title compound. 

'H-NMR (DMSO-d*, 400 MHz): 8 2.11 (s, 3H), 2.20 (s, 3H), 2.62 (m, 2H), 3.00 (m, 2H), 7.43 (d, 
20 J=8.2 Hz, 1H), 7.59 (d, J=8.2 Hz, 1H), 9.53 (s, 1H). 

Step 3: 5-(acetv1aminn)-?- f4-fluoro 

To a solution of 5-<acetylamino)-4-methyl-l-indanone (0.5 g, 2.46 mmol) and 4- 
fluorobenzaldehyde (1.2 eq.) in 10 mL of anhydrous EtOH were added dropwise under stirring 1.5 mL of 
25 0.5M NaOMe/MeOH. The mixture was stirred for 16 h at room temperature and the formed precipitate 
was filtered off and washed with EtOH. After drying under vacuum 606 mg of the title compound were 
obtained. 

'H-NMR (DMSO-ds, 400 MHz): 8 2.12 (s, 3H), 2.30 (s, 3H), 4.03 (s, 2H), 7.33 (m, 2H), 7.50 (s, 
1H), 7.59 (d, J=8.0 Hz, 1H), 7.67 (d, J=8.0 Hz, 1H ), 7.89 (m, 2H), 9.58 (s, 1H). 
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Step 4: 5-face ty!aTnmn )-2-f4-fluorobenzvD^methvl-l-indanone 

A suspension of the product from step 3 (651 mg, 2.1 mol) and palladium on charcoal (10%, 150 

mg) in 60 mL EtOAc was stirred under a hydrogen atmosphere at 40°C. After 1.5 h the catalyst was 

filtered off and the solution was concentrated to dryness under reduced pressure to afford the titile 

5 compound (438 mg). 

*H-NMR (DMSO-de, 300 MHz): 8 2.09 (s, 3H), 2.12 (s, 3H), 2.60-2.75 (m, 2H), 2.95-3.21 (m, 
3H), 7.11 (m, 2H), 7.31 (m, 2H), 7.45 (d, J=8.1 Hz, 1H), 7.61 (d, J=8.1 Hz, 1H ), 9.52 (s, 1H). 

Stegjx 7-flmm o-9a-(4-fluorobenzvn^,8Ki^ 

10 

To a stirred suspension of 5-(acetylamino)-2-(4-fluorobenzyl)-4-methyl-l-indanone (438 mg, 
1.41 mmol) and ethyl vinyl ketone (1.25 eq.) in 2.8 mL anhydrous methanol were added dropwise a 
solution of sodium methoxide (0.5 M) in methanol (0.5 mL). The mixture was stirred and heated to 60°C 
for 3 hours. After cooling to room temperature the mixture was concentrated under reduced pressure. 

15 The residue was dilutesd with ethyl acetate and washed with water and brine. The organic phase was 
dried over Na 2 S0 4 , filtered and concentrated under reduced pressure. An oily material was obtained, 
which was dissolved in a mixture of 10 mL HOAc and 10 mL 6 N HC1. The mixture stirred and heated to 
80°C for 3 h. After cooling to room temperature the mixture was neutralized with saturated aq. NaHC0 3 
and extracted with ethyl acetate. The organic phase was dried over Na 2 S0 4 , filtered and concentrated 

20 under reduced pressure to obtain the title compound as a yellow foam ( 198 mg). 

'H-NMR (CDC1 3 , 300 MHz): 8 1.94-2.03 (m, 1H), 2.07 (s, 3H), 2.12 (s, 3H), 2.14-2.23 (m, 1H), 
2.43 (d, J=15.9 Hz, 1H), 2.50-2.69 (m, 2H), 2.72-2.83 (m, 1H), 2.85 (d, J=13.8 Hz, 1H), 3.02 (d, J= 
15.9 Hz, 1H), 3.94 (s, br, 2H), 6.63 (d, J=8.2 Hz, 1H), 6.86-7.04 (m, 4H), 7.43 (d, J=8.2 Hz, 1H). 

25 Step 6: 6-methvl-9a-(4-fluorobenzvn -8.9.9a, 10-tetrahvdroindenor2.1-g1indazol-7(3Jin-one 
To a solution of 7-amincn9a-(4-fluorobenzyl)^,8-dim^ 
(196 mg, 0.59 mmol) in CH 2 C1 2 (5 mL) at -35°C was added nitrosium tetrafluoroborate (69 mg, 
0.59 mmol). The mixture was stirred for one hour during which the temperature was left reaching 4°C. 
The mixture was again cooled to -35 °C, diluted with CH 2 CI 2 (5 mL) and potassium acetate (122 mg, 

30 1.24 mmol) and dibenzo-1 8-crown-6 (8 mg, 22 jimol) were added. The cooling bath was removed and the 
mixture was stirred for 1 hour at room temperature. The mixture was then partitioned between CH 2 C1 2 
and water. The organic phase was washed with brine, dried over Na 2 S0 4 , filtered and concentrated under 
reduced pressure. The crude product was purified by flash chromatography on silicagel (eluent: CH2CI2/ 
EtOAc 9:1, v/v), The title compound was obtained as an orange solid (78 mg). 

35 MS: m/z 347.0 for [M+H] + 
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'H-NMR (DMSO-d* 30O MHz): 5 2.07 (s, 3H), 2.1-2.2 (m, 2H), 2.33-2.45 (m, 1H), 2.63 (d, 
J=13.4 Hz, 1H), 2.72-2.95 (m, 3H), 3.32 (d, 1H, in part overlaid by water signal), 6.83-7.12 (m, 4H), 7.46 
(d, J=8.7 Hz, 1H), 7.69 (d, J=8.7 Hz, 1H), 8.15 (s, 1H), 13.27 (s, 1H). 
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Example 5 

Synthesis of 9a-Benzvl-6-i442-fl-Pip eri di^ 

E1Indazol-7(3feyOne Hvdrotrifluoroacetae Salt (CMP9^ 

5 




1 . EVK, NaOMe, 
MeOH, 60°C 

2. pyrrolidine.HOAc, 
PhMe, THF, 100°C 



AcNH 




NaOMe, MeOH, 
EtOH 
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Step 1: 5-fAc& ty1amin nV2-benzvli(ieiie-4-methvl-l-indanone 
5 To a suspension of 5^ac^tylamino)-4-methyl-l-indanone (2.0 g, 9.84 mmol), prepared as 

described in example 5, and benzaldehyde (1 1.9 mmol) in 40 mL of anhydrous EtOH were added 
dropwise under stirring 6.0 mL of 0.5M NaOMe/MeOH. The mixture was stirred for 6 h at room 
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temperature and the formed precipitate was filtered off and washed with EtOH. After drying under 
vacuum 2.11 g of the title compound as ayelow solid were obtained. 

'H-NMR (DMSO-d* 400 MHz): 5 2.14 (s, 3H), 2.30 (s, 3H), 4.32 (m, 2H), 7.42 (m, 4H), 7.60 (d, 
J=8.0 Hz,lH), 7.68 (d, J=8.0 Hz, 1H), 7.73 (m, 2H), 9.58 (s, 1H). 

5 

Step 2: 5-f AceWlaminoV2-benzvI--4-methvl-l-indanone 

A suspension of the product from step 3 (2.11 g, 7.25 mol) and palladium on charcoal (10%, 200 
mg) in a mixture of 40 mL EtOAc and 40 mL MeOH was stirred under a hydrogen atmosphere at 40°C. 
After 1.5 h the catalyst was filtered off and the solution was concentrated to dryness under reduced 
10 pressure. After flash chromatography on silicagel (eluent: CH 2 Cy EtOAc 9: 1, v/v) the tide compound 
was obtained as a white solid (1.64 g). 

'H-NMR (DMSO-d 6> 400 N4Hz): 5 2.10 (s, 3H), 2.12 (s, 3H), 2.60-2.75 (m, 2H), 2.97-3.12 (m, 
2H), 3.13-3.22 (m, m), 7.15-7.32 Cm, 5H), 7.46 (d, J=7.8 Hz, 1H), 7.60 (d, J=7.8 Hz, 1H ), 9.52 (s, 1H). 

15 Step 3: 7-(AceWlammoV9a-benzyl-^^ 

To a stirred solution of 5-(acetylamino)-2-benzyl-4-methyl-l-indanone (1.64 g, 5.60 mmol) and 
methyl vinyl ketone (7.06 mmol) in 14 mL anhydrous methanol were added dropwise a solution of 
sodium methoxide in methanol (0.5 M, 2.3 mL). The mixture was stirred at room temperature for 19 
hours. It was then concentrated under reduced pressure and the residue partitioned between ethyl acetate 

20 and brine. The aqueous phase was extracted with ethyl aceate and the combined organic phases were 
dried over Na 2 S0 4 , filtered and concentrated under reduced pressure. An oily material was obtained, 
which was dissolved in a mixture of 51 mL THF, 4.4 mL toluene, 0.5 mL pyrrolidine and 0.27 mL 
HOAc. The mixture stirred and heated to 100°C for 75 min. After cooling to room temperature the 
mixture was concentrated under reduced pressure. After flash chromatography on silicagel (eluent: 

25 CH2CI2/ EtOAc 8:2, v/v) 1 . 1 8 g of the tide compound was obtained. 

'H-NMR (DMSO-d* 400 MHz): 8 1.92-2.03 (m, 1H), 2.05-2.13 (m, 6H), 2.13-2.19 (m, 1H), 
2.38-2.63 (m, 3H, in part under solvent signal), 2.73-2.86 (m, 1H), 2.93 (d, J=13.3 Hz, 1H), 3.10 (d, 
J=15.7 Hz, 1H), 6.28 (s, 1H), 7.10-7.26 (m, 5H), 7.37-7.53 (m. 2H), 9.39 (s, 1H). 
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Step 4: 7-(Acetvlamino)-4-bromo-9a-benzvl-8-me 



To a stirred suspension of 7-anrino-9a-ben2yl-8-methy^ 
(421 mg, 1.2 mmol) and NaHC0 3 (500 mg) in CH 2 C1 2 (11 mL) at 0°C was added dropwise a solution of 
5 bromine (1 eq.). The mixture was stirred for addirional 15 min at 0°C and then diluted with water 

(40 mL). The organic phase was separated, dried over Na 2 S0 4 and concentrated under reduced pressure 
to afford the title compound (595 mg). 

^-NMR (DMSO-d 6 , 400 MHz): 8 2.02-2.20 (m, 8H), 2.55-2.67 (m, 3H), 2.93-3.13 (m, 3H), 
7.03-7.22 (m, 5H), 7.55 (d, J=7.3 Hz, 1H), 8.18 (d, J=7.3 Hz, 1H), 9.42 (s, 1H). 

10 

Step 5: 7-amin o^-bromo-9a-benzvl-8-methvM 

A solution of 7-(acetylamino)-4-bromo-9a-benzyl-8-methyl-l,2,9,9a-tetrahy 
one (1.56 g, 3.41 mmol) in EtOH (22 mL) and sodium methoxide in methanol (0.5 M, 21 mL) under 
nitrogen was stirred and heated to 80°C for 6.5 h. 

15 After cooling to room temperature the mixture was partitioned between ethyla cetate and brine. 

The organic phase was dried over Na 2 S04, filtered and concentrated under reduced pressure to obtain the 
title compound as a yellow foam (983 mg). 

! H-NMR (DMSO-d 6 , 400 MHz): 5 1.95-2.10 (m, 5H), 2.40-2.60 (m, 3H, in part under solvent signal), 
20 2.92-3.09 (m, 3H), 5.89 (s, br, 2H), 6.60 (d, J=7.9 Hz, 1H), 7.03-7.26 (m, 5H), 8.02 (d, J=7.9 Hz, 1H). 

Step 6: 6-bromo-9a-benzvl >8.9.9aJ0-tetrahvdroindenor2.1-e1indazol-7f3/f)-one 
To a solution of 7-amino-4-bromo-9a-benzyl-8-methyl-^ 

(983 mg, 2.57 mmol) in CH 2 C1 2 (14 mL) at -35°C was added nitrosonium tetrafluorobbrate (304 mg, 
25 2.60 mmol). Stirring was continued for 70 min while the temperature was allowed to reach 0°C. The 

solution was cooled again to -35°C and KOAc (509 mg, 5.2 mmol) and dibenzo-18-crown-6 (48 mg) 

were added. The cooling bath was removed and the stirring was continued for 2 h at room temperature. 

The solution was diluted with CH 2 C1 2 (100 mL) and washed with water and brine. The organic phase was 

dried over Na 2 S0 4 , filtered and concentrated under reduced pressure. After flash chromatography on 
30 silicagel (eluent: CH 2 Cy EtOAc 9: 1, v/v) 353 mg of the tide compound were obtained as a brown-red 

solid. 

'H-NMR (DMSO-de, 300 MHz): 8 2.1-2.3 (m, 2H), 2.67-2.78 (m, 2H), 2.90-3.20 (m, 3H), 3.45 
(d, J=16.7 Hz, 1H),6.90-7.15 (m, 5H), 7.50 (d, J=9.3 Hz, 1H), 8.11 (s, 1H), 8.39 (d, J=9.3 Hz, 1H), 13.40 
(s, 1H). 

35 
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Step 7: 6-bromo-9a-benzvl-3 4^^ 
7f3fl)-one 

A solution of 6-bromo-9a-benzyl -S^^aJO-tetrahydroindeno^J-^indazol-TCS/O-one (353 mg, 
0.90 mmol), 4-(dimethylamino)-pyridine (165 mg, 1.35 mmol) and p-toluenesulfonyl chloride (207 mg, 
5 1.09 mmol) in 5 mL anhydrous CH 2 C1 2 was stirred under nitrogen for 1 h at 0°C and 2 h at room 
temperature. After dilution with CH 2 C1 2 the sultion was washed sequentially with water, phosphate 
buffer (1M, pH 3), 5% aq. NaHC0 3 and brine. The organic phase was dried over Na 2 S0 4 , filtered and 
concentrated under reduced pressure. After flash chromatography on silicagel (eluent: petroleum ether/ 
EtOAc 7:3, v/v) 156 mg of the title compound were obtained. (The 2-[(4-methylphenyl)sulfonyl]-isomer 
10 (200mg) was eluted before the title compound). 

! H-NMR (CDC1 3 , 400 MHz): 8 2.14-2.25 (m, 1H), 2.30-2.48 (m, 1H), 2.41 (s, 3H), 2.74-3.10 (m, 
5H), 3.40 (d, J=16.8 Hz, 1H), 6.90-7.13 (m, 5H), 7.31 (d, J=8.2 Hz, 1H), 7.92 (d, J=8.2 Hz, 1H), 8.15 (s, 
1H), 8.18 (d, J=9.1, 1H), 8.72 (d, J=9.1, 1H). 

15 Step 8: 9a-benzvl-6-r4-(m e&oxvmethox 

tetrahvdroindeno-r2.1-g1indazol-7(3j ! if)-one 

A solution of 6-bromo-9a-benzyl-3-[(4-me 

e]indazol-7(3//)-one (156 mg, 0.34 mmol), Pd(PPh 3 ) 4 (18 mg, 15.5 fimol), and tributyl-(4- 

methoxymethoxy-phenyl)-stannane (160 nag, 0.37 mmol) in anhydrous toluene (2.5 mL) was purged with 
20 nitrogen and heated with stirring to 100°C under nitrogen for 24 h. The mixture was cooled to room 

temperature and concentrated under vacuum. After flash chromatography on silicagel (eluent: petroleum 

ether/ EtOAc 1:1, v/v) 137 mg of the title compound were obtained. 

'H-NMR (CDCI3, 400 MHz): 5 2.20-2.31 (m, 1H), 2.35-2.45 (m, 4H), 2.69-2.77 (m, 1H), 2.80- 

2.88 (m, 2H), 2.91-3.08 (m, 2H), 3.40 (d, J=16.8 Hz, 1H), 3.60 (s, 3H), 5.29 (m, 2H), 6.61 (d, J=8.8 Hz, 
25 1H), 6.90-7.30 (m, 11H), 7.74 (d, J=8.8 Hz, 1H), 7.87 (d, J=8.2 Hz, 2H), 8.12 (s, 1H). 
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Step 9: 9a-benzyl-6-(4-hvdr^^ 
e1indazol-7(3flVone 

A solution of 9a-benzyl-6-[4-(methoxymethoxy^ 
5 tetrahydroiiideno-[2,l-e]indazol-7(3i?)-one (137 mg, 0.226 mmol) in methanol (5.2 mL) and aqueous 2N 
HC1 (0.52 mL)was stirred and heated to 80°C for 1 h. After cooling to room temperature and the mixture 
was concetrated under reduced pressure. The residue was partitioned between EtOAc and water and the 
organic phase was washed with brine. It was then dried over Na 2 S0 4 , filtered and concentrated under 
vacuum to leave 124 mg of the title compound as a yellow foam. 
10 lH NMR (CDCI3, 400 MHz) 5 2.20-2.30 (m, 1H), 2.34-2.45 (m, 4H), 2.69-2.77 (m, 1H), 2.79- 

2.87 (m, 2H), 2.91-3.08 (m, 2H), 3.40 (d, J=16.8 Hz, 1H), 6.61 (d, J=8.8 Hz, 1H), 6.80-7.32 (m, 11H), 
7.73 (d, J=8.8 Hz, 1H), 7.85 (d, J=8.2 Hz, 2H), 8.11 (s, 1H). 

Step 10: 9a-benzvl-6-M-r2-Q-pjperid ^ 

15 7(3H)one hvdrotrifluoroacetate salt 

To a solution of 9a-benzyl-6-(4-hydroxyphenyl)-3-[(4-methylphenyl)sulfonyl]-8,9,9a,10- 
tetrahydroindeno-[2,l-e]indazol-7(3J c 0-one (124 mg, 0.22 mmol), triphenylphosphine (175 mg, 
0.66 mmol) and piperidineethanol (88 \jlL 7 0.22 mmol) in 2.0 mL of anhydrous tetrahydrofurane at 0°C 
under nitrogen was added dropwise a solution of diisopropyl-azodicarboxylate (140 jiL, 0.72 mmol) in 

20 anhydrous tetrahydrofurane (500 jjL). The solution was stirred for 1 h at 0°C and for 5 h at room 
temperature. The product was purified by flash chromatography on silica gel, using CH 2 Cl 2 -5 vol% 
MeOH as eluent After removal of the solvents under reduced pressure the crude (4- 
methylphenyl)sulfonyl-protected intermediate was obtained as a yellow foam, which was dissolved in a 
mixture of 2 mL EtOH, 2 mL 1,4-dioxane and 1 mL IN aqueous NaOH. The mixture was stirred for 3 h 

25 at room temperature, acidified with HOAc and the siolvents removed under reduced pressure. The 

residue was partitioned between CH 2 C1 2 and brine. The organic phase was dried over Na 2 S0 4 , filtered 
and concentrated under vacuum. The crude product was purified by preparative RP-HPLC, using water 
(0.1 % TFA) and acetonitrile (0.1 % TFA) as eluents (column: Waters SymmetryPrep C18, 
22 x 100 nun). The pooled product fractions were lyophilized to afford 25 mg of the title compound. 

30 MS: m/z 518.2 for [M+H] + 

'H-NMR (DMSO-d*, 400 MHz): 8 1.33-1.50 (m, 1H), 1.64-1.90 (m, 5H)„ 2.18-2.30 (m, 2H), 
2.40-2.50 (m, 1H, in part under solvent signal), 2.79 (d, J=12.8 Hz, 1H), 2.83-3.10 (m, 6H), 3.50-3.62 (m, 
4H, in part under water signal), 4.41 (m, 2H), 6.30 (d, J=8.4 Hz, 1H), 6.94-7.25 (m, 10H), 8.13 (s, 1H), 
9.38 (s, br, 1H). 
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Example 6 

Synthesis of 9a-(4-Huorobenzvn ^-MeA^ 

One fL884.653) 

5 




Cr0 3 , HOAc 
15-17°C 



Rr if MeOH, 6N HCI /? 

N0 2 H NO a 



Pd(C), Ha, KOAc, 
EtOAc 




F 



-72- 




WO 2005/040212 



PCT/EP2004/011683 



Step 1: 5-acetamido-indane 

A solution of 5-aminoindane (30g, 225.2 mmol) in acetic acid (75 mL) and acetic anhydride (55 
mL) was refluxed for 1.5 h and the solvents were distilled off under reduced pressure. A brown oil 
remained which solidified upon standing at room temperature. The crude material (39.8g) was dissolved 
5 in methanol (100 mL) and 1M aqueous K 2 C0 3 (35 mL) was added. The solution was left standing at 

room temperature for 30 minutes and then partitioned between CH 2 C1 2 (500 ml) and water (400 mL). The 
organic phase was dried over Na 2 S0 4 , filtered and concentrated under reduced pressure to afford 33.5g of 
the titile compound as a light brown solid. 

'H-NMR (CDCI 3 , 400MHz): 8 7.45 (s, 1H), 7.15 (s, 3H), 2.88 (m, 4H), 2.17 (s, 3H), 2.08 (m, 

10 2H). 

Step 2: 6-Bromo-5 -acetamido-indane 

To a solution of 5-acetamido-indane (lOg, 57.1 mmol) in glacial acetic acid (170 mL) cooled to 
ca. 10°C was added bromine (3.6 mL) over a period of lh, mantaining the temperature around 10°C. 
15 After stirring for an additional 10 minutes at 10°C, the mixture was diluted with water (600 mL). 

Stirring was continued until a solid precipitate had formed (15 min). The precipitate was filtered off and 
dissolved in CH 2 C1 2 (100 mL). A residual water phase was separated and the organic phase was dried 
over Na 2 S0 4 , filtered and concentrated to afford 12.45 g of the titile compound as light yellow solid. 

'H-NMR (CDCI3, 400MHz): 8 8. 14 (s, 1H), 7.52 (s, br, 1H), 7.38 (s, 1H), 2.88 (m, 4H), 2.10 (m, 

20 2H). 

Step 3: 6-Bromo -5 -acetamido-indan- 1 -one 

To a stirred solution of 6-bromo-5-acetamido-indane (12.45 g, 49 mmol) in acetic acid (170 mL) 
cooled to 15°C was added dropwise a solution of Cr0 3 (16.7 g, 167 mmol) in water (33 mL) over 25 
25 minutes, keeping the temperature between 15-17°C. The mixture was stirred for an additional 90 min. at 
16-18°C and then water (520 mL) was added. Stirring was continued for 30 minutes at 5°C. The formed 
precipitate was filtered off, washed with cold water and dried under high vacuum for 16 hours. The title 
compound was obtained obtained as a light yellow solid (8.0 g). 

'H-NMR (CDC1 3 , 400MHz): 5, 8.6O (s,lH), 7.94 (m, 2H), 3.10 (m, 2H), 2.71 (m, 2H), 2.31 (s, 

30 3H). 
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Step 4: 6-Broip n>5>acetaTTii do-4--iiitro-iiidan-l-Qne 

Nitric acid (65 mL) was cooled to -40°C Then 6-bromo-5-acetairrido-indan-l-one (8.0 g, 29.8 
mmol) was added portionwise over a period of 2 min. The resulting solution was stirred for 50 minutes, 
during which the temperature was allowed to raise gradually to -20°C. The reaction mixture was poured 
5 into water (1. 1 L). The formed precipitate was filtered off, washed with water, dried first under air stream 
and then under high vacuum for 16 h. The title compound was obtained as a light yellow solid (7.5g). 

'H-NMR (DMSO-d 6 , 400MHz): 5 10.43 (s, 1H), 8.19 (s, 1H), 3.18 (m, 2H), 2.73 (m, 2H), 2.09 

(s, 3H). 

10 Step 5: 6-Bromo-5-amino^-nitro-indan-l-one 

A suspension of 6-bromo-5 -acetamido~4-nitro-indan- 1 -one (3 g, 9.6 mmol) in methanol (150 mL) 
and aqueous HC1 (120 mL, 6.25 N) was refluxed for 2.5. After cooling to room temperature the mixture 
was poured into water (1.8 L) and stirred for 5 minutes at room temperature. The formed precipitate was 
filtered off and washed with water (200 mL). After drying for 24 hours under high vacuum 1 .9 g of the 
15 title compound were obtained as an orange solid. 

'H-NMR (DMSO-d 6 , 400MHz): 5 7.95 (s, 1H), 7.79 (s, 3H), 3.30 (m, 2H), 2.55 (m, 2H). 

Step 6: 4.5- raamina- indan-l-one 

A suspension of 6-Bromo-5 -amino-4-nitro-indan- 1 -one (13.9 g, 51 mmol), NaOAc (32 g) and 
20 palladium on charcoal (10 %, 2.55 g) in 1 L anhydrous EtOAc was stirred under a hydrogen atmosphere 
at atmospheric pressure for 24 hours. Further catalyst (1 g) was added and stirring was continoued for 24 
hours. The catalyst was filtered off and washed with EtOH (800 mL). The combined organic phases 
were concentrated to ca 50 mL and partitioned between CH 2 C1 2 (500 mL) and 5% NaHC0 3 (500 mL). 
The organic phase was dried over Na 2 S0 4 , filtered and concentarted to dryness. The title compound was 
25 obtained an orange solid (5.3 g). 

'H-NMR (DMSO-d* 400MHz): 8 6.80 (d, J=10.0 Hz, 1H), 6.55 (d, J=10.0 Hz, 1H), 5.45 (s, 2H), 
4.53 (s, 2H), 2.70(m, 2H), 2.49(partially covered by DMSO, m, 2H). 

Step 7: 7.8-Dihvdrn mdenor4.5-JirL231triazol-6(3/n-one 

30 Under warming 4,5-diaminoindan-l-one (1.0 g, 6.2 mmol) were dissolved in 80 ml EtOH. After 

cooling to room temperature 6.1 mL of 37 % HQ and 1.5 mL of water were added. The solution was 
cooled to 0°C and a solution of 1.38 g Na N0 2 in 6.5 mL water were added dropwise. The resulting dark 
brown mixture was stirred for further 50 minutes at ca 5°C. The mixture was partitioned between 500 
mL EtOAc and 400 mL water. The organic phase was washed with 300 mL brine, dried over NaS0 4 and 

35 the solvent was distilled off under vacuum. The product was purified by flash chromatography on a 
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silicagel, using EtOAc/petrolether (1:1, v/v) as eluent The title compound was obtained as an orange 
solid (632 mg). 

MS:m/zl74for[M+H] + . 

'H-NMR: (DMSO-d*, 400 MHz) 5 (ppm) 7.85 (d, 8.5 Hz, 1H), 7.64 (d, 8.5 Hz, 1H), 3.45 (m, 
5 2H), 2.77 (m, 2H). 

Step 8: 7^4-Fluoro-ben2^Udeney7.8^ 

7,8-Dihydroindeno[4,5-d][l,2,3]triazol-6(3fl)-one (200 mg, 1.16 mmol) were dissolved in 17 ml 
EtOH under warming. After cooling to room temperature 139 pL (1 .5 eq.) of 4-fluoro-benzaldehyde and 
10 5.0 mL of 0.5 M NaOMe in MeOH (1.5 eq.) were added. The reaction mixture was stirred at room 

temperature for 17 h. Under stirring 3.7 mL of 1 N HQ and 200 mL of water were added. The formed 
precipitate was filtered off and dried under high vacuum. 260 mg of the title compound were obtained. 
MS: m/z 280 for [M+H] + . 

X H-NMR (DMSO-d* 400 MHz): 5 (ppm) 16.3 (s, br, ca.lH), 8.00-7.85 (m, 3H), 7.81 (m, 1H), 
15 7.61 (s, 1H), 7.39 (m, 2H), 4.49 (s, 2H). 

Step 9: 7^4-Huoro-benzviy7.8-dihvdre^ 

7-(4-Huoro-benzylidene>7,8K^ (239 mg, 0.86 mmol) 

were dissolved 30 ml EtOAc-EtOH (1:1, v/v) under warming. After cooling to room temperature 48 mg 
20 of Pd/C (10%) were added. The reaction mixture was stirred under hydrogen at atmospheric pressure for 
6 h at 45 °C. The catalyst was filtered off and the solvent was distilled off under reduced pressure. 7-(4- 
. Huoro-benzylH>8Klihydroindeno[4,5^[l,2,3]triazol-6(3//)-one (253 mg) was obtained as a colourless 
oil and was used in the next step without further purification. 
MS: m/z 282 for [M+H] + . 

25 

Step 10: 9a-(4- Fluoro-benzvn-6>methvl-8,9.9a. 10-tetrahvdrofluoienori,2diria31-triazol-7f3f/Vone 

7<4-Huoro-benzyl)-7,8Kiihydroindeno[4,5^[l,2,3]tiiazol^(3^K>ne (253 mg, 0.86 mmol) were 
dissolved in 3 mL of 0.5M NaOMe/MeOH. EVK (127yL, 1.5 eq.) was added, the mixture was heated to 
60°C for 5 h. The reaction mixture was cooled to room temperature and partitioned between 150 mL 

30 EtOAc and 100 mL 1 M aqueous KH 2 P0 4 . The organic phase was dried over NaS0 4 , filtered and the 

solvent was didtilled off under reduced pressure. An oily material was obtained, which was dissolved in a 
mixture of 5mL HO Ac and 5mL 6 N HCL The mixture stirred and heated to 100°C fori h. After cooling 
to room temperature and the mixture was partitioned between EtOAC and mL 1 M aqueous KH 2 P0 4 . The 
organic phase was dried over Na 2 S0 4 and filtered. After removal of the solvent by distillation under 

35 reduced pressure the crude product was purified flash-chromatography on silicagel (eluent: EtO Ac/petrol 
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ether 1:2, v/v). The pooled product fractions were dissolved in 10 mL of 0.5M NaOMe/MeOH and the 
mixture was stirred and heated to 50°C for 40 min. After cooling to room temperature and the mixture 
was partitioned between EtOAC and mL 1 M aqueous KH 2 P0 4 . The organic phase was dried over 
Na 2 S0 4 and filtered. After removal of the solvent by distillation under reduced pressure the crude 
5 product was dissolved methyl-ter butyl ether (10 mL) and precipitated by addition of petroleum ether 

(35 mL). The precipitate was washed with petroleum ether and dried under high vacuum to afford 9a-(4- 
fluoro-benzyl)-6-methyl-8,9,^ (38 m g) as a pale 

yellow solid. 

MS: m/z 348.0 for [M+H] + , 

10 J H-NMR (MeOD-d4, 400 MHz): 6 2.21 (s, 3H), 2.22-2.30 (m, 1H), 2.33-2.40 (m, 1H), 2.53-2.62 

(m, 1H), 2.85-2,98 (m, 3H), 3.02 (d, J=16.8 Hz, 1H), 3.58 (d, J=16.8 Hz, 1H), 6.74 (m, 2H), 7.01 (m, 2H), 
7.73 (d, J=8.4 Hz, 1H), 7.84 (d, J=8.4 Hz, 1H). 



-76- 



WO 2005/040212 



PCT/EP2004/011683 



The following compounds were prepared using methods analogous to those described in the preceding 
examples: 

R 3 X 




R 1 

5 Table la (antagonists): 



CM 
P 


R1 


R2 


R3 


X 


MS 


GST- 
ERct 
MG- 
LBD 
IC50 
[nM] 


GST- 

ERa 

wt- 

LBD 

IC50 

[nM] 


hER 

a 
!C50 
[nM] 


hER|3 
IC50 
[nM] 


Name 


6 


H 


benzyl 




o 


383.2 


29 


1371 


223 


96 


9a-benzyl-7-hydroxy- 
4-(4-hydroxyphenyl)- 
1 ,2,9,9a-tetranydro- 
3H-fluoren-3-one 


7 


H 


benzyl 


n-butyl 


o 


347.2 


25 


1306 


259 


307 


9a-benzyl-4-butyl-7- 
hydroxy-1 ,2,9,9a- 
tetrahydro-3H- 
fluoren-3-one 


8 


H 


4- 
chloro- 
benzyl 




o 


528.2 


29 


4436 


1085 


2232 


9a-(4-chlorobenzyl)- 
7-hydroxy-4-{4-[2-(1- 
piperidinyl)- 
ethoxyjpheny!}- 
1 ,2,9,9a-tetrahydro- 
3H-fluoren-3-one 


10 


H 


benzyl 


l-^y^OCHaCHaN^) 


o 


494.4 


22 


1272 


375 


184 


9a-benzyl-7-hydroxy- 
4-{4-[2-(1- 
piperidinyl)- 
ethoxyjphenyl}- 
1 ,2,9,9a-tetrahydro- 
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3H-fluoren-3-one 


11. 


H 


benzyl 


|-^~^-0CHsCH 2 N^ 


o 


495.8 


25 


2308 


706 


231 


9a-benzyl-7-hydroxy- 
4-{4-[2-(4- 
morpholinyl)- 
ethoxy]phenyl}- 
1 ,2,9,9a-tetrahydro- 
3H-fluoren-3-one 


12 


H 


benzyl 




o 


453.8 


40 


4171 


1196 


555 


9a-benzyl-4-{4-[2- 
(dimethylamino)ethox 
y]phenyl}-7-hydroxy- 
1 ,2,9,9a-tetrahydro- 

3H-fluoren-3-one 


13 


H 


benzyl 




o 


367.6 


2022 


NAat 
3600 
nM 


n.d. 


n.d. 


9a-benzyl-7-hydroxy- 

4-pyridin-2-yl- 
1 ,2,9,9a-tetrahydro- 
3H-fluoren-3-one 


14 


H 


benzyl 


^> 


o 


368.0 


471 


NAat 
3600 
nM 


n.d. 


n.d. 


9a-benzyl-7-hydroxy- 

4-pyndm-3-yl- 
1 ,2,9,9a-tetrahydro- 
3H-fluoren-3-one 


15 


H 


benzyl 




o 


368.0 


79 


725 


1182 


558 


9a-benzyl-7-hydroxy- 

4-pyridin-4-yl- 
1 ,2,9,9a-tetrahydro- 
3H-fluoren-3-one 
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Table lb (agonists): 



CM 
P 


R1 


R2 


R3 


X 


MS 


GST- 
ERcc 
MG- 
LBD 
IC50 
[nM] 


GST- 
ERa 
wt-LBD 
IC50 
[nM] 


hER 

a 
IC50 
[nM] 


hERp 
IC50 
[nM] 


Name 


4 


H 


benzyl 


Me 


o 


n.d. 


13.7 


2320 


1470 


506 


9a-benzyl-7- 
hyd roxy-4- methy I- 
1 ,2,9,9a-tetrahydro- 
3H-fluoren-3-one 


16 


H 




Me 


o 


323.2 


14 


n.d. 


n.d. 


n.d. 


9a-(4-fluorobenzyl)- 
7-hydroxy-4-methyl- 
1 ,2,9,9a-tetrahydro- 
3H-fluoren-3-one 


17 


H 




Me 


o 


340.4 


21.7 


6830 


n.d. 


n.d. 


9a-(4-chlorobenzyl)- 
7-hydroxy-4-methyl- 
1 ,2,9,9a-tetrahydro- 
3%{H}-fluoren-3-one 


18 


H 




Me 


o 


339.0 


15 


n.d. 


n.d. 


n.d. 


9a-(3-chlorobenzyl)- 
7-hydroxy-4-methyl- 
1 ,2,9,9a-tetrahydro- 
3%{H}-fluoren-3-one 


19 


H 


CI 


Me 


o 


339.2 


66.6 


n.d. 


n.d. 


n.d. 


9a-(2-chlorobenzyl)- 
7-hydroxy-4-methyl- 
1 ,2,9,9a-tetrahydro- 
3%{H}-fluoren-3-one 
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20 


H 




Me 


O 


321.2 


85 


n.d. 


n.d. 


n.d. 


7-hydroxy-9a-(4- 
hydroxybenzy!)-4- 

methyl-1 ,2,9,9a- 
tetrahydro-3%{H}- 
fluoren-3-one 


21 


H 




Me 


o 


332.8 


11.4 


n.d. 


n.d. 


n.d. 


7-hydroxy-4-methyl- 
9a-(3-phenylpropyl)- 
1 ,2,9,9a-tetrahydro 
3%{H}-fluoren-3-one 


22 


H 


benzyl 


Me 


N-OMe 


n.d. 


60 


n.d. 


n.d. 


n.d. 


(3%{E})-9a-benzyl- 
7-hydroxy-4-methyl- 
1 ,2,9,9a-tetrahydro- 
3%{H}-f'Uoren-3-one 

%{0}-methyloxirne 


23 


H 


benzyl 


Me 


N-OH 


n.d. 


199 


n.d. 


n.d. 


n.d. 


9a-benzyI-8-bromo- 
7-hyd roxy-4-methyl- 
1 ,2,9,9a-tetrahydro- 
3%{H}-fluoren-3-one 
oxime 


24 


Br 


benzyl 


Me 


O 


383.0/ 
384.4 


436 


NAat 
180000 
nM 


n.d. 


n.d. 


9a-benzyl-6,8- 
dibromo-7-hydroxy- 
4-methyl-1 ,2,9,9a- 
tetrahydro-3%{H}- 

fluoren-3-one 



5 
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Table 2: 



CM 
P 


R2 


R3 


X 


MS 


GST- 
ERoc 
MG- 
LBD 
IC50 
[nM] 


GST- 
ERa 
wt- 
LBD 
IC50 
[nM] 


hERa 

IC50 

[nM] 


hERp 
IC50 
[nM] 


Name 


1 


benzyl 


H 


o 


315.0 


3900 


>10000 


>10000 


>10000 


9a-benzyl -8,9,9a, 10- 
iciiai lyuruir iu6nu[^, i- 
e]indazol-7(3H)-one 


5 


benzyl 


Me 


o 


329.0 


155 


>10000 


>10000 


1247 


6-methyl-9a-benzyl - 

8,9,9a,10- 
tetrahydroindeno[2,1- 
e]indazol-7(3H)-one 


9 


R2: benzyl 
R3: 

!-^~y^OCH 2 CH2N^> 


o 


518.2 


107 


1126 


622 


125 


9a-benzyl-6-{4-[2-(1- 
piperidinyl)ethoxy]phenyl 

}-8,9,9a,10- 
! tetrahydroindeno[2,1 - 

e]indazol-7(3H)-one 
hydrotrifluoroacetae salt 


25 




Me 


o 


347.0 


82 


n.d. 


>10000 


997 


6-methyl-9a-(4- 
fluorobenzyl) -8,9,9a,10- 
tetrahydroindeno[2, 1 - 
e]indazol-7(3H)-one 


26 


F 


Me 


o 


347.0 


444 


n.d. 


>10000 


1068 


6-methyl-9a-(3- 
fluorobenzyl) -8,9,9a,10- 
tetrahydroindeno[2, 1 - 
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e]indazol-7(3H)-one 


27 


F 


Me 


O 


347.0 


135 


n.d. 


4298 


529* 


6-methyI-9a-(2- 
fluorobenzyl) -8,9,9a,10- 
tetrahydroindeno[2,1- 
e]indazol-7(3H)-one 


28 




Me 


O 


n.d. 


5500 


n.d. 


n.d. 


n.d. 


6-methyI-9a-(2- 
naphthylmethyl) - 
8,9,9a,10- 
tetrahydroindeno[2,1 - 
e]indazol-7(3H)-one 


29 




Me 


o 


335.2 


146 


n.d. 


n.d. 


n.d. 


6-methy!-9a- 
cyclohexylmethyl - 
8,9,9a,10- 
tetrahydroindeno[2,1 - 
e]indazol-7(3H)-one 




5 Table 3: 



CMP 


R2 


R3 


X 


MS 


GST- 


GST- 


hERa 


hERp 


Name 












ERa 


ERa 


IC50 


IC50 














MG- 


wt- 


[nM] 


[nM] 














LBD 


LBD 


















IC50 


IC50 


















[nM] 


[nM] 
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30 


benzyl 


Me 


O 


330.O 


1163 


n.d. 


>1000 
0 


4186 


9a-benzyl-6-methyl- 
8,9,9a,10- 
tetrahydrof luoreno[1 ,2- 
d][1 ,2,31triazol-7(3H}-one 


31 




Me 


O 


348.0 


340 


n.d. 


>1000 
0 


>10000 


9a-(4-fIuorobenzyl)-6- 
methyl-8,9,9a,10- 

f ■ |_ | _f ft m M M m^, 

tetrahydrofluoreno[1 ,2- 
d][1 ,2,3]triazol-7(3H)-one 


32 


F 


Me 


O 


348.2 


2450 


n.d. 


n.d. 


n.d. 


9a-(2-fluorobenzyl)-6- 

methyl-8,9,9a,10- 
tetrahydrof luoreno[1 ,2- 
d][1 ,2,31triazo!-7(3H)-one 


33 




Me 


O 


408.0 
/410. 
0 


880 


n.d. 


n.d. 


n.d. 


9a-(4-bromobenzy!)-6- 

rnethyl-S^^a.lO- 
tetrahydrof luoreno[1 ,2- 
d]ri ,2,31triazol-7(3H)-one 


34 


F 3 C 


Me 


O 


398.0 


4100 


n.d. 


>1000 
0 


>10000 


9a-[2- 
(trifluoromethyl)benzyl]- 

6-methyl-8,9,9a,10- 
tetrahydrofluoreno[1 ,2- 
d][1 ,2,31triazo!-7(3H)-one 


35 




Me 


O 


398.0 


6900 


n.d. 


n.d. 


n.d. 


9a-[4- 
(trifluoromethyl)benzyl]- 

6-methyl-8,9,9a,10- 
tetrahydrofluoreno[1 ,2- 
d][1 s 2,31triazol-7(3H)-one 


36 




Me 


O 


336.0 


4000 


n.d. 


n.d. 


n.d. 


9a-(thien-2-ylmethyl)-6- 
methyl-8,9,9a,10- 

t^trsihvrlmfli mronnM Q_ 
icu cii iyui uiiuui d i\J[ I , 

d][1 ,2,3]triazol-7(3H}-one 


37 




Me 


O 


336.0 


2300 


n.d. 


n.d. 


n.d. 


9a-(thien-2-ylmethyl)-6- 

methyl-8,9,9a,10- 
tetrahydrofluoreno[1 ,2- 
d][1 ,2,3]triazol-7(3H}-one 
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38 




Me 


O 


344.0 


3300 


n.d. 


n.d. 


n.d. 


9a-(1 -pheny!ethyI)-6- 


















methyl-8,9,9a,10- 
tetrahydrof luoreno[1 ,2- 
d][1 ,2,3]triazol-7(3H}-one 



Example 7 

5 Design and Construction of an ER-LBD Mutant Library 

Because it had previously been shown to work in the context of chimeric constructs, the isolated 
ERa-LBD represented an attractive candidate to develop the veneered LBD needed for a truly orthogonal 
transcriptional switch. Different from its natural isoforms, the veneered LBD should be unable to bind 
the ligands of ER, such as estradiol, but able to bind an inactive analogue of the ligands. Although it is 

10 structurally related to the ligands, the inactive analogue is unable to bind the natural estrogen receptor. 

We examined possible candidates for the inactive analogue according to the following criteria. 
First, the candidates should be inactive compounds (against both hERa and hERP) within an otherwise 
generally active series. Second, the general structures of the candidates should require only a limited 
modification around the ligand-binding pocket of hERa LBD. Third, the candidates preferably show a 

15 generally acceptable pharmacokinetic profile. 

According to these criteria it was decided to synthesize CMP1 (Fig. 1A), a compound with a 
large benzyl substitution at the 9a position. Choice of the 9a position was motivated by the observation 
that modification at this position in a series of hERP-selective tetrahydrofluorenone ligands developed 
from an in-house program generally caused a strong reduction in binding affinity. Furthermore, 

20 replacement of the phenolic hydroxy group by a pyrazole heterocycle was known to improve the 

pharmacological properties of the active series. The newly synthesized CMP1 indeed displayed a very 
poor binding affinity for both hERa and hER0 (IC 50 values >10 4 nM, respectively, Fig. 1A) and 
represented therefore a promising starting point around which a receptor mutagenesis strategy could be 
designed. 

25 According to the X ray crystal structures of agonist-bound hERa and hERP LBDs, CMP1 was 

modelled into the hERa and hERP binding pockets assuming three distinct major conformations for the 
9a benzyl substituent (R 2 , table 2) predicted by energy calculations (Fig. 3). Five residues within the 
ligand-binding pocket were identified as the most likely candidate residues interfering with binding of 
CMP1: L391/L343, F404/F356, M421/I373, 1424/1376, L428/L380 of hERa or hERP, respectively. A 



-84- 



WO 2005/040212 PC T/EP2004/0 11683 



mutant library including these five positions should cover nearly 300° of conformational space of the 9a 
benzyl rotamer (R 2 , table 2), including all three major conformations of the benzyl substituent. 

Because the interference with binding of CMP1 was expected to arise from steric clashes between 
the side-chains and the 9a benzyl moiety (R 2 , table 2), substitutions into smaller or, more generally, 
5 different residues should remove this hinderance. Due to the generally hydrophobic character of both the 
binding pocket and the benzyl substituent, we decided to use only Gly, Ala, Cys, Val, He, Leu, Met, Phe, 
Tyr or Trp as possible substitutions at the five positions. 

The analysis further showed that differences between hERa and hERfi affecting binding of 
CMP1 should be restricted to only two residues, L384/M336 and M421/I373 of hERoc and hER(3, 
10 respectively. Since the pyrazole compound series showed preferential binding to hER0 (Fig. 1 A), we 
decided to generate the library in the context of a L(384)M mutated hERa. The second residue, M421, 
corresponded to one of the positions mutated in the library and inclusion of He as a possible substitution 
for M421 therefore implicitly generated a library containing also the wt hER{3 ligand-binding pocket. 

Choice of hERa over hERP was motivated by two factors. First, the biology of hERa was much 
15 better known. Second, related successful experiments using the hERa ligand-binding domain (LBD) had 
already been described in the literature. In vitro experiments using GST-fusion constructs with single or 
combined L(384)M and M(421)I variants of the hERa LBD reproduced the binding data observed for the 
full-length wt hERP (Fig. 1 A). 

20 Example 8 

Validation of the Library Complexity in the Context of a Yeast Selection System 
hi order to genetically select for CMPl-responsive variants, the L384M mutated hERa LBD was 
fused to the GAL4 DNA-binding domain (DBD) and the VP16 transactivation domain (Fig. 2A). The 
responsiveness of this chimera to the pyrazole series of compounds was determined through a titration 

25 curve against estradiol (Ej) (Fig. 2B) or the active compound CMP2 (Fig. 2C). As before, the L(384)M 
substitution was both necessary and sufficient to modulate the affinity for the chimeric transcription 
factor. In addition, it demonstrated that the pyrazole series of compounds was able to penetrate the yeast 
membrane, a property that is frequently absent tbms impeding the use of the yeast system as a genetic 
selector. 

30 Generation of the library of hERa-LBD variants took advantage of the inherent yeast 

recombination system. Two oligonucleotides degenerate for 10 different amino acids at each of the five 
library positions were synthesized using a di-nucleotide assembly strategy, including sufficiendy long 
(>30bp) constant flanking regions to direct the homologous recombination. 
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To decrease the likelihood that clones with single- or double-mutations would be lost during the 
synthesis, construction or selection of the library, the concentration of wt codons was raised to 15% for 
each of the mutated positions. As a consequence the relative abundance of 1- or 2- residue variants was 
significantly raised (by about 8- and 4-fold, respectively) compared to only a minor reduction in the 
5 relative abundance of 5- residue variants (factor of about 0.8). This strategy was also in line with our 
expectation that predominantly 1- or 2-residue variants rather than more complex variants would be 
necessary to accommodate the CMP1 ligand, and, at the same time, be tolerated by the LBD structural 
framework. 

Preparative amounts of the mutated LBD fragments were synthesised by PCR amplification of 
10 hER LBD template using the degenerated oligonucleotide mix. The mutated fragment collection was 
then included in a scaled-up co-transformation experiment together with a linearised recipient L(384)M 
hER LBD vector. Approximately 10 6 colonies corresponding to a 10-fold library redundancy were 
plated, and 12 randomly chosen clones revealed the expected library complexity (data not shown). 

In order to probe the library for selection capacity, -His/-Trp growth-selective plates were treated 
15 with increasing amounts of the hERP-selective CMP3. Sequencing of 9 randomly picked clones 

indicated that, different from the non-selected clones, most of the CMP3-selected clones had either zero 
or few amino acid substitutions in all five mutagenized positions, being L391 and M421 residues 
completely conserved (data not shown). It appears therefore that a very restricted number of mutations in 
the hER binding pocket is tolerated to maintain a productive interaction with a high affinity ligand of the 
20 wt receptor. No clones containing the M(421)I substitution were selected, consistent with the observation 
that no further increase in the binding affinity for CMP3 was observed by adding this second substitution 
to the L(384)M mutated protein (Fig. 1 A). 
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Example 9 

Genetic Library Screening and Analysis of Selected ER-LBD Mutated Variants 
Having confirmed the expected composition and responsiveness of the library, we repeated the 
growth selection in the presence of the inactive analogue CMP1 at a concentration of 1 jiM. 1600 
5 independent clones were subsequently screened against 1 pM CMP1 or DMSO, as a control, on -Trp/X- 
gal-containing plates. 54 independent clones were judged positive for (5-Galactosidase trans-activation in 
this experiment and 28 amongst them retained positivity also in the presence of 0.1 ptM CMP1. A parallel 
experiment was carried out using the same strategy but performing the initial growth selection and first 
round of white/blue screening in the presence of a 10-fold higher compound concentration (10 ^M). 31 
10 clones retained positivity when challenged with 0. 1 pM CMP1 and were therefore selected for further 
characterization together with the 28 positive clones of the former experiment. 

DNA sequence analysis revealed a consensus sequence of the selected mutant variants (Fig. 4 and 
SEQ ID NO: 3-15). The most prominent feature was the mutation of M421 into a residue containing a 
smaller side chain (mostly G and few A) that occurred in 86% of the selected clones. Importantly, this 
15 mutation was never observed in clones selected using the active pyrazole compound CMP3 (data not 

shown), thus excluding a bias in the screening procedure. An isolated M421G mutation was found with a 
relatively low frequency and a second mutation of 1424 to either M, V or L was present in most of the 
clones. In addition, a third mutation of F404 to W was present in 28% of the mutated clones, whereas 
positions L391 and L428 were generally conserved. The consensus sequence present in most selected 
20 variants is consistent with a model in which the benzyl substituent of CMP1 is directed towards the 
positions originally occupied by M421 and 1424 (Fig. 4). 

Western blot analysis of yeast protein extracts from all representative clones shown in Fig. 4 
using a monoclonal antibody directed against VP16 AD did not show significant differences in the 
expression levels of the corresponding chimeric proteins (not shown). 
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Example 10 

Novel Tetrahvdrofluorenoiie Compounds Interact Selectively 
with ER-LBD Variants Mutated in M421 
Representative selected arrays of mutations shown in Fig. 4 were introduced in the pGEX-hERa- 
5 L(384)M-LBD prokaryotic expression construct and the corresponding GST-fusion protein variants were 
expressed in R coll Suitable aliquots of crude bacterial extracts containing comparable amounts of all 
different protein variants were tested for direct binding in vitro to increasing concentrations of 3 [H|- 
estradiol. All variants bound the natural ligand E2 with a reduced affinity compared to the hERa- 
L(384)M-LBD protein (wt in all 5 mutagenized positions) (data not shown). The single M(421)G- 
10 selected mutant bound estradiol with the highest affinity with a kd value approximately 9-fold higher than 
wt (46 nM and 5 nM, respectively) (Fig.lA). The kd value for the hormone was nevertheless sufficiently 
low to measure the affinity of this polypeptide for a series of novel tetrahydrofluorenone compounds 
including CMP1, CMP4, CMP5 (Fig. 1A) and all those listed in tables la, lb, 2, and 3 in 3 [H] E2- 
displacement assays. A glycine in position 421 was confirmed to confer selectivity of binding to the 
15 majority of the compounds tested (Fig. 1A and tables lb, 2, 3). The affinity for tetrahydrofluorenones 
containing a phenol group was significantly higher than for those containing a pyrazole substituent. 
Furthermore, the binding affinity was higher for compounds with a methyl group in position 4 than for 
those containing a hydrogen atom in the corresponding position. 

20 Example 11 

ER-LBD Variants Mutated in M421 Show Fluorenone-Denendent 
Transcriptional Activity 
A series of representative selected mutants was transferred in the yeast strain Y187, in which the 
integrated lacZ reporter gene is under the control of the intact GAL1 promoter and is therefore more 
25 tightly regulated and more efficiently expressed than in the yeast strain CG-1945 (the librajy^ontaining 
strain). GAL4/VP16 chimeras containing ER LSD L(384)M with or without the additional selected 
mutation M(421)G were tested in ligand-dependent 0-Galactosidase trans-activation experiments in the 
presence of E2, CMP4 or CMP5 compounds (Fig. 1A). Representative experiments are shown in Figures 
5A-5C. Our results confirmed that the mutant containing a glycine in position 421 was selectively 
30 activated by both CMP4 and CMP5 cognate ligands with EC50 values of 36 nM and 217 nM, respectively, 
comparable with the IC50 values measured in vitro (Fig. 1 A). 

We also reversed the original mutation L(384)M in the context of the M(421)G selected mutant, 
thus generating a single-substituted ERa LBD MC421)G chimeric protein. The transcriptional activity 
induced in yeast by the pyrazole compound CMP5 was strongly impaired in the absence of the L(384)M 
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mutation (data not shown), indicating that the combination of the two amino acid substitutions was 
necessary to define the ligand specificity of the selected mutant. 

For what concerns the response to estradiol, the EC 5 o value measured with the M(421)G mutant 
(1.5 nM) was approximately eight-fold higher than that associated with the parental clone (0.2 nM), a 
5 difference of the same order of magnitude of that measured in vitro (see Fig. 1 A). 

Example 12 

ER-LBP Variants Bearing D(351 WH524V Mutations Respond Poorly 
to Estradiol and Show Low Ligand-Independent Activity 

10 At this point we had accomplished the first step obtaining ER-LBD variants with the desired shift 

in specificity towards the inactive analogues. To make the system practically useful, two additional 
essential properties had to be introduced. First, the L(384)M/M(421)G (MG)-LBD variant should be 
orthogonal against the natural ligand estradiol. Second, the ligand-independent transcriptional activity 
should be reduced to background levels. 

15 We targeted amino acid residues which make contacts with the D-ring of estradiol in the crystal 

structure of the hER LBD-hormone complex. Since tetrahydrofluorenone compounds formally lack a 
structural equivalent of the hormone D-ring, we reasoned that alterations in the D-ring-interacting region 
of the LBD should not severely compromise binding to these ligands. H1S524, hydrogen-bonded to the 17- 
hydroxyl of estradiol was therefore mutated to Val in the context of MG-LBD variant Furthermore, 

20 Gly521, which makes hydrophobic van der Waals contacts with the D-ring of estradiol, was also 

substituted with Val, Leu, Met, or Arg. The corresponding GAL4/VP16 chimeric proteins harboring 
these new mutations were challenged in ligand-dependent ji-Galactosidase transcription assays in the 
presence of either estradiol or ligand CMP4. 

Although all substitutions of Gly 52 i resulted in a significant decrease of both basal and estradiol- 

25 induced trans-activation levels in yeast, they also caused a dramatic loss of response to the fluorenone 

compound CMP4 (data not shown). In contrast, H524V displayed a strongly reduced affinity for estradiol 
of about 200-fold (Fig. 6A) and a concomitant reduction in affinity of only 3-fold for ligand CMP4 (Fig. 
6B). 

A third and important step to devise a transcription factor aimed for the regulation of trans-gene 
30 expression in vivo was to minimize its background activity in the absence of added ligand. The MG-LBD 
"lead" selected variant containing the additional H(524)V mutation described above still retained relative 
high constitutive activity levels (Fig. 7A). 

Published data indicated that amino acid substitutions which abolish the negative charge of 
Asp3si in the context of full-length hERcc determine a significant decrease of its basal activity without 
35 interfering with any ligand-dependent response. Therefore, a D(35 1)A mutation was inserted in the MG- 
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LBD "lead" selected variant containing the additional H(524)V mutation. The presence of the D(351)A 
mutation determined a 14-fold reduction of ligand-iadependent activity with a concomitant significant 
increase of the maximal fold-induction by the cognate ligand CMP4 (Fig. 7A). 

Having fixed an airay of mutations within hERa-LBD, [D(351)A, L(384)M, M(421)G, H(524)V] 
which changed the specificity of activation in response to ligands in yeast, we wished to verify that this 
altered specificity was maintained in mammalian cells. Therefore, we transferred both the mutated and 
wt GAL4/hERa-LBD/VP 1 6 sequences in a mammalian expression vector and evaluated their ligand- 
dependent transcriptional activity by co-transfection in HeLa cells with a GAL4-responsive reporter 
plasmid expressing SEAP under the control of five GAL4 UAS repeats (5GAL4UAS-pSEAP). The 
ability of the mutant chimeric transcription factor to mediate reporter gene activation in response to the 
cognate ligand CMP4 and estradiol was compared with the response of the chimera containing the wt 
hERa-LBD. 

The result shown in Fig. 7B demonstrates that the constitutive activity of the mutant was as low 
as that of wt. Maximal activity levels shown by the mutant in response to saturating concentrations of 
CMP4 were also comparable with those obtained with wt in response to saturating concentrations of 
estradiol (60-90 fold induction). Furthermore, the maximal response of the mutant to saturating 
concentrations of estradiol was significantly impaired (10 % of that shown by wt). 

Example 13 

ER-LBD Variants Bearing a G521R Mutation Respond Poorly to Estradiol. Show Low Ligand- 
Independent Activity and Can be Induced bv Antagonistic Compounds 
In addition to the H524V variant, we followed a different strategy taking advantage of the well- 
known biological properties of the ER-LBD G(521)R substitution. Estrogen receptors bearing this 
mutation in their LBD exhibit both the desired properties: a low basal transcriptional activity and 
strongly reduced affinity for the estradiol ligand. Since our experiments showed the incompatibility of 
the agonistic series of fluorenone compounds with the G(521)R mutation, possible modifications of the 
ligand were explored. The rationale behind this reasoning was the notion that the G(521)R-substituted 
MG-LBD variant, as described for the wt ER G(521)R, was still proficient in the response to 4- 
hydroxytamoxifen (4-OH Tarn), an ER antagonist (Fig. 8A). More generally, ligands having an 
"antagonistic tail" apparently can compensate for loss of affinity introduced by G(521)R in the agonist 
binding pocket. A new series of compounds was therefore generated by substituting the methyl group at 
the R 3 position with more bulky side-chains mimicking an "antagonistic tail" (Fig. IB and table la). Both 
pyrazole (CMP9, Fig. IB. See also table 2) and phenol derivatives (Fig. IB and table la) containing 
different R 3 side-chains were challenged in competitive radiometric in vitro assays for binding to the MG- 
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LBD selected mutant and to ERa wt-LBD polypeptides. All compounds showed a significant selectivity 
for the mutant relative to the wt LBD and to both a and 0 full-length ER, albeit to different extent. 
Selectivity varied between 50 and 150 fold when the mutant and wt ERa LBD were compared, the 
differences being somewhat less pronounced on the full-length receptors. Although the presence of an 
5 antagonist side-chain apparently did not influence the binding affinity for the mutant when measured in 
vitro, no significant response to most of these compounds was detected in yeast transcriptional assays up 
to 10 |xM concentrations (not shown). This suggests that the uptake by yeast cells of compounds 
containing bulky, positive-charged side-chains is strongly impaired, as already observed for many known 
anti-estrogens. 

10 An exception was compound CMP6 containing a phenol substituent in R 3 which was able to 

significantly induce in a concentration-dependent fashion the transcriptional activity of the L(384)M, 
M(421)G, G(521)R variant in yeast cells (Fig. 8B). On the contrary, only a very weak response was 
observed for a variant containing only the L(384)M and G(521)R mutations and the compound was 
completely ineffective on the wt ERa LBD chimera up to 10 MM concentration. Similar results were 

15 obtained for CMP7 (Fig. 8C), demonstrating that a butyl substituent in R 3 is sufficient to confer an 

"antagonistic" nature to the ligand. More importantly, these data indicate that the specificity conferred by 
the ER-LBD M(421)G substitution for the phenyl group in the 9a position (R 2 , table la) of the ligand was 
preserved. As expected, the relative response levels were reversed when 4-OH Tarn was tested. The 
L(384)M, M(421)G, G(521)R variant showed indeed a significantly lower affinity for this antagonist 

20 compared with both the L(384)M and G(521)R and the wt ERa LBD chimera. 

Example 14 

Regulation of Gene Expression by Compounds of the Antagonistic Series 
The combination of the L384M/M421G/G521R ER-LBD variant with an antagonistic ligand thus 
25 exhibited the desired properties in order to be applicable in gene therapy applications. To test the system 
in mammalian cells, we therefore modified a ligand-dependent transcription regulator, called HEA-1, that 
we have recently generated. 

In HEA-l three elements are fused together the DNA-binding domain (DBD) of human HNF- 
lcc (aa 1-282), a G(521)R mutant of the LBD of the human ERa (aa 303-595), and a portion of the 
30 activation domain of the human p65 protein (aa 285-551). HEA-1 promotes transcription of transgenes 
downstream a multiple HNF-1 binding site in a stringent 4-OH Tarn dependent manner with up to 
hundred-fold drug-dependent transgene induction in ceD culture (Roscilli et al., 2002 Mol Ther. 5:653- 
663). Moreover, HEA-1 enables tight regulation of gene expression in vivo by treating mice with 
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Tamoxifen (TAM), which is predominantly metabolized to 4-OH Tarn in vivo (KEF) (Roscilli et al., 2002 
MolTher. 5:653-663). 

The intrinsic antagonistic activities of TAM and its metabolite 4-OH Tarn are however 
responsible for a number of side effects (such as increased incidence of uterine cancer) that become more 
5 pronounced during prolonged treatments and therefore might preclude the use of this system for long- 
term gene therapy applications. The use of inducer molecules with a very low affinity for endogenous 
ER-alpha and ER-beta should thus widen the potential application range of HEA-1 -based transcription 
regulatory systems. 

We therefore generated HEA-2 (SEQ ID NO: 43), which is a triple ER-LBD mutant 
10 L(384)M/M(421)G/G(521)R in the context of HEA-1, ajid tested its responsiveness to antagonist-type 
compounds. The amino acid sequence of HEA-2 is listed in SEQ ID NO 43. The corresponding cDNA 
was cloned into an eukaryotic expression vector and co-transfected in HeLa cells along with the reporter 
plasmid 7xHl/CRP/SEAP (Roscilli et al., 2002 Mol Ther. 5:653-663). Cells were treated with increasing 
concentrations of four antagonistic compounds and SEAP levels were measured in the culture medium 
15 after 24 hour treatment. 

The results showed that the chimera was stimulated by all four compounds in a dose-dependent 
manner. A representative dose-response curve with CMP8 compound is shown in Fig. 9A. hi the 
absence of treatment, a basal SEAP level of 0.8 ± 0.2 ng/ml was measured, similar to that measured in the 
culture medium of cells transfected only with the reporter plasmid (not shown). This indicated the 
20 absence of a significant basal activity in this experimental setting. About 1900 ng/ml of SEAP were 
measured at the highest concentration of CMP8, which corresponds to about 2000-fold increase as 
compared to the uninduced condition. Similar fold inductions were consistently observed in duplicate 
experiments and with all the four compounds (not shown). 

Fig. 9A also demonstrates that the chimera is activated by Ea only at the highest 
25 concentrations of the hormone (17-fold and 50-fold induction at 500 nM and 1 jiM E2). Consistent with 
the in vitro binding analysis (Fig. IB), CMP9 displayed the lowest activity with an EC50 of approximately 
300 nM, while the EC50 of the other compounds ranged from 1 1 to 40 nM (Fig. 9B). 

Other embodiments are within the following claims. While several embodiments have 
been shown and described, various modifications may be made without departing from the spirit and 
30 scope of the present invention. 
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